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Preface 


This quarterly publication provides archival reports on developments in programs 
managed by JPL’s Office of Telecommunications and Data Acquisition (TDA). In 
space communications, radio navigation, radio science, and ground-based radio and 
radar astronomy, it reports on activities of the Deep Space Network (DSN) in plan- 
ning, supporting research and technology, implementation, and operations. Also 
included are standards activity at JPL for space data and information systems 
and reimbursable DSN work performed for other space agencies through NASA. 
The preceding work is all performed for NASA’s Office of Space Communications 
(OSC). The TDA Office also peforms work funded by two other NASA program 
offices through and with the cooperation of OSC. These are the Orbital Debris 
Radar Program (with the Office of Space Systems Development) and 21st Century 
Communication Studies (with the Office of Exploration). 

For the High-Resolution Microwave Survey (HRMS), The TDA Progress Report 
reports on implementation and operations for searching the microwave spectrum. 
In solar system radar, it reports on the uses of the Goldstone Solar System Radar 
for scientific exploration of the planets, their rings and satellites, asteroids, and 
comets. In radio astronomy, the areas of support include spectroscopy, very long 
baseline interferometry, and astrometry. These three programs are performed for 
NASA’s Office of Space Science and Applications (OSSA), with OSC funding DSN 
operational support. 

Finally, tasks funded under the JPL Director’s Discretionary Fund and the Cal- 
tech President’s Fund that involve the TDA Office are included. 

This and each succeeding issue of The TDA Progress Report will present material 
in some, but not necessarily all, of the following categories: 

OSC Tasks: 

DSN Advanced Systems 
Tracking and Ground-Based Navigation 
Communications, Spacecraft-Ground 
Station Control and System Technology 
Network Data Processing and Productivity 
DSN Systems Implementation 
Capabilities for Existing Projects 
Capabilities for New Projects 
New Initiatives 

Network Upgrade and Sustaining 
DSN Operations 

Network Operations and Operations Support 
Mission Interface and Support 
TDA Program Management and Analysis 
Ground Communications Implementation and Operations 
Data and Information Systems 
Flight-Ground Advanced Engineering 
Long-Range Program Planning 

OSC Cooperative Tasks: 

Orbital Debris Radar Program 
21st Century Communication Studies 
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OSSA Tasks: 

High-Resolution Microwave Survey 
Goldstone Solar System Radar 
Radio Astronomy 

Discretionary Funded Tasks 
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Feasibility Study of a Ka-/Ka-Band Dichroic Plate 
With Stepped Rectangular Apertures 

J. C. Chen 

Ground Antennas and Facilities Engineering Section 


For the Cassini spacecraft mission, a dichroic plate is needed to pass Ka-band 
up/in A' (34.2 to 34.7 GHz) and to reflect Ka-band downlink (31.8 to 32.3 GHz) for 
dual-frequency operation in the Deep Space Network. The special characteristic 
of the Ka-/Ka-band dichroic plate is that the passband and the reflective band 
are only 1.9 GHz (5.7 percent) apart. A thick dichroic plate with stepped rectan- 
gular apertures that function as resonator filters was chosen for the Ka-/Ka-band 
dichroic plate design. The results of the feasibility study are presented in this 
article. 


I. Introduction 

For the existing dichroic plates in the Deep Space Net- 
work (DSN), the passband frequency is approximately four 
times higher than the reflecting-band frequency. For ex- 
ample, the S-/X-band (2.2-GHz/8.45-GHz) dichroic plate 
reflects 2.2 GHz and passes 8.45 GHz, and the X-/Ka-band 
(8.45-GHz/32.0-GHz) dichroic plate reflects 8.45 GHz and 
passes 32.0 GHz. In these cases the sizes of the aper- 
tures are large enough to pass the higher frequency band, 
yet small relative to the wavelength of the reflective fre- 
quency, resulting in efficient reflection. However, if the 
passband and reflective band are close [as they are in 
the Ka-/Ka-band (31.8- to 32.3-GHz/34.2- to 34.7-GHz) 
dichroic plate], the apertures on the dichroic plate may be 
large enough to pass the higher frequency passband, but 
may not be small enough to cut off the lower frequency 
adequately. A dichroic plate with straight rectangular 
apertures may not satisfy this requirement. Therefore, a 


dichroic plate with stepped rectangular apertures, which 
act as filters, was chosen for the Ka-/Ka-band dichroic 
plate feasibility study. 

II. Analysis 

It is assumed that the dichroic plate is illuminated by 
linearly polarized plane waves with oblique incidence, in 
this case 30 deg from normal. The analysis of the di- 
chroic plate with stepped rectangular apertures is based 
on a modal matching method. The electromagnetic field 
in the free-space region is represented by Floquet modes 
and the field in the waveguide region is represented by 
rectangular waveguide modes. By applying the boundary 
condition at the junctions and using the modal match- 
ing method, the scattering matrix of the dichroic plate is 
then achieved [1]. Two computer programs, the Thick 
Frequency Selective Surface With Rectangular Apertures 
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program 1 and the Rectangular Waveguide program, 2 were 
integrated in order to analyze the dichroic plate with 
stepped rectangular apertures. The Thick Frequency Se- 
lective Surface With Rectangular Aperture program cal- 
culates the scattering matrix of the free space and wave- 
guide junction, and the rectangular waveguide program 
computes the scattering matrix of the stepped waveguide 
region. These matrices are then combined to form the 
scattering matrix of the dichroic plate with stepped rect- 
angular apertures. The computer code, the Thick Fre- 
quency Selective Surface With Stepped Rectangular Aper- 
tures program, was used to design a Ka-/Ka-band dichroic 
plate. Because multiple design parameters were involved, 
an optimization program was utilized to speed up the de- 
sign process. 

The number of modes representing the electromagnetic 
fields on either side of the junction has to be sufficient 
to ensure the convergence in the modal matching method 
[2], There is an equation in the program in which the 
number of Floquet modes used increases as the number 
of waveguide modes increases. The speed of the conver- 
gence versus the number of waveguide modes may vary 
depending on the structure of the stepped waveguide. For 
a straight rectangular aperture, a total of 40 rectangular 
waveguide modes is sufficient [3], but in this particular 
design, a total of 180 rectangular waveguide modes was 
necessary. Figure 1 shows how the transmission coeffi- 
cient at 34.7 GHz fluctuates as the number of waveguide 
modes varies. The highest waveguide number indicates 
that the waveguide numbers M and N are not greater 
than that number. For example, if the highest waveguide 
number is equal to one, the TEq\, TE io , TE \\ , and TM\\ 
rectangular waveguide modes are used to expand the elec- 
tromagnetic field in the waveguide region. Figure 1 indi- 
cates that the convergence is achieved when the highest 
waveguide numbers reach nine (180 waveguide modes) or 
higher. 

Ill, Design and Theoretical Performance 

The Ka-/Ka-band dichroic plate design is an array of 
stepped rectangular apertures on a 5. 6388- mm grid with a 
60-deg skew angle. Each stepped aperture consists of five 
steps (Fig. 2). The stepped aperture is a length of rect- 
angular waveguide ( A\ x B\ = 4.6126 mm x 4.6228 mm) 


1 The Thick Frequency Selective Surface With Rectangular Aperture 
program was written by J. C. Chen, Jet Propulsion Laboratory, 
Pasadena, California, and was submitted to COSMIC, University 
of Georgia, 382 E, Broad St., Athens, Georgia 30602. 

2 The Rectangular Waveguide Program was written by D. J. Hoppe, 
Radio Frequency and Microwave Subsystems Section, and F. Man- 
shadi, Ground Antennas and Facilities Engineering Section, Jet 
Propulsion Laboratory, Pasadena, California. 


with two thin irises ( Ai x £?2 = 3.81 mm x 3,9268 mm, 
thickness I = 0.508 mm) inside. The irises divide the 
waveguide into three sections with lengths of 9 8044 mm 
(Ti), 9.144 mm (£ 2 ), an d 9.8044 mm. The first and the 
last sections act as resonators with low quality factor Q } 
while the center section acts as a resonator with high Q. 
The size of the main waveguide aperture is smaller than 
the cutoff size of the highest frequency at the downlink and 
larger than the cutoff size of the lowest frequency of the 
uplink. The stepped aperture is basically a resonator filter 
to pass the uplink and reflect the downlink. The advantage 
of having only two different waveguide sizes is to reduce 
the fabrication costs. Since multiple metal sheets can be 
stacked together to be wire-electrical-discharge machined 
for an identical pattern, only two sets of sheets need to be 
run through the machine. 

Theoretical performance of the Ka-/Ka-band dichroic 
plate design was calculated and is shown in Table 1. The 
transmission of the Ka-band downlink is about —36 to —48 
dB over the bandwidth. The downlink is almost totally re- 
flected, but there is a relative phase shift of 7.26 ±0.77 deg 
over the bandwidth between two orthogonal linear polar- 
izations (TE and TM polarizations). The phase shift be- 
tween two polarizations can be compensated for by the 
polarizer, which is connected to the horn in the microwave 
system. The transmission of the passband (uplink) is bet- 
ter than —0.19 dB over the bandwidth for both TE polar- 
ization and TM polarization. The relative phase shift of 
the passband is 9.45 ±1.00 deg over the Ka-band uplink. 
The curve of the magnitude of the transmission coefficients 
for TE and TM polarizations from 31 to 36 GHz shows 
nice roll-off from reflective band to passband (Fig. 3). The 
transmission coefficient over the passband is shown in more 
detail in Fig. 4. The relative phase shift between TE 
and TM polarizations varies less than ±1.00 deg in both 
downlink and uplink (Fig. 5). Since the incident wave in 
the Deep Space Network is circularly polarized (CP), the 
transmitted CP power is about 0.01 to 0.15 dB with el- 
lipticity of 1.3 to 1.65 dB over the passband, and total 
reflected power is below —37 dB with ellipticity of 2.09 dB 
(Figs. 6 and 7). 

A tolerance study was also done over the downlink and 
uplink band widths (Table 2). The large aperture dimen- 
sions (A\ and Bi), small aperture dimensions (A 2 and 
B 2 ), length of each step (L\ and L 2 ), and thickness of the 
iris-like step were examined in the tolerance study. One 
dimension was varied by ±0.0127 mm (or ±0.0254 mm) at 
a time, while the rest of the dimensions were kept at the 
design size. The last row in Table 2 is the performance 
at the design dimensions without any variation. For ex- 
ample, the transmission coefficient for TM polarization 
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changes from 0.10 ±0.09 (tenth row, seventh column) to 
0.21 ±0.2 dB (fourth row, seventh column) over the band- 
width when B\ is decreased by 0.0127 mm. The perfor- 
mance of the dichroic plate is more sensitive to changes 
in the dimensions of the large apertures than the small 
apertures. Changing the length of the apertures has little 
effect on the performance of both uplink and downlink. 
Overall, variation in the dimensions of the dichroic plate 
with stepped apertures has less impact on the performance 
of the downlink than the uplink. 


IV. Conclusion 

The analysis of the performance of a dichroic plate with 
stepped apertures indicates that it is a feasible choice for 
the Ka-/Ka-band dichroic plate. The stepped aperture is 
a new technique in dichroic plate design, therefore the fab- 
rication technique and experiment method require further 
study. Also, the accuracy of the new program needs to be 
verified. Fabrication of a test Ka-/Ka-band dichroic plate 
is the next step in understanding its performance. 
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Table 1. Transmission coefficients for TE and TM polarizations 
and relative phase shift between TE and TM polarizations for a 
Ka-/Ka-band dichroic plate with stepped rectangular waveguide. 


Ka-band 


Transmission coefficients, 
Frequency, JB 

GHz 

TE TM 


Relative 
phase shift, 
deg 


Downlink 

31.8 

-47.94 

-46.24 

8.033 


32.0 

-44.27 

-42.56 

7.505 


32.3 

-38.28 

-36.46 

6.496 

Uplink 

34.2 

-0.102 

-0.193 

10.468 


34.5 

-0.013 

-0.062 

10.495 


34.7 

-0.025 

-0.168 

8.450 


Table 2. Tolerance study for Ka-/Ka-band dichroic plate with stepped rectangular apertures. 


Dimensions 
of stepped 
aperture, 
mm 

Variation, a 
mm 

Ka-band downlink 
(31.8 to 32.3 GHz) 
reflection** 


Ka-band uplink 
(34.2 to 34.7 GHz) 
transmission 0 

TEy 

dB 

TM, 

dB 

A phase, 
deg 

TE, 

dB 

TM, 

dB 

A phase, 
deg 

,4!, 4.6126 

+0.0127 

0.00 ±0.00 

0.00 ±0.00 

4.98 ±1.19 

0.03 ±0.02 

0.10 ±0.09 

1.92 ±0.05 


-0.0127 

0.00 ±0.00 

0.00 ±0.00 

9.43 ±0.39 

0.09 ±0.08 

0.06 ±0.05 

21.12 ±2.05 

Bi, 4.6228 

+0.0127 

0.00 ±0.00 

0.00 ±0.00 

9.35 ±0.41 

0.06 ±0.05 

0.08 ±0.08 

20.38 ±2.54 


-0.0127 

0.00 ±0.00 

0.00 ±0.00 

5.27 ±1.08 

0.06 ±0.04 

0.21 ±0.20 

0.80 ±0.44 

A 2t 3.8100 

+0.0127 

0.00 ±0.00 

0.00 ±0.00 

7.23 ±0.78 

0.06 ±0.05 

0.10 ±0.09 

7.99 ±0.06 


-0.0127 

0.00 ±0.00 

0.00 ±0.00 

7.30 ±0.75 

0.06 ±0.04 

0.10 ±0.09 

1.07 ±1.23 

B 2 , 3.9268 

+0.0127 

0.00 ±0.00 

0.00 ±0.00 

7.34 ±0.73 

0.06 ±0.03 

0.11 ±0.10 

11.21 ±1.14 


-0.0127 

0.00 ±0.00 

0.00 ±0.00 

7.22 ±0.79 

0.06 ±0.04 

0.11 ±0.11 

8.19 ±0.81 

I, 0.5080 

+0.0127 

0.00 ±0.00 

0.00 ±0.00 

7.26 ±0.77 

0.06 ±0.04 

0.10 ±0.10 

9.52 ±1.04 


-0.0127 

0.00 ±0.00 

0.00 ±0.00 

7.27 ±0.77 

0.06 ±0.05 

0.10 ±0.09 

9.40 ±0.98 

L \ , 9.8044 

+0.0254 

0.00 ±0.00 

0.00 ±0.00 

7.26 ±0.77 

0.05 ±0.04 

0.09 ±0.08 

9.55 ±1.06 


-0.0254 

0.00 ±0.00 

0.00 ±0.00 

7.27 ±0.77 

0.06 ±0.05 

0.11 ±0.10 

9.42 ±0.98 

L 2 , 9.1440 

+0.0254 

0.00 ±0.00 

0.00 ±0.00 

7.27 ±0.77 

0.06 ±0.05 

0.10 ±0.09 

9.39 ±1.01 


-0.0254 

0.00 ±0.00 

0.00 ±0.00 

7.26 ±0.77 

0.05 ±0.04 

0.10 ±0.09 

9.52 ±1.00 


0.000 d 

0.00 ±0.00 

0.00 ±0.00 

7.26 ±0.77 

0.06 ±0.04 

0.10 ±0.09 

9.45 ±1.00 


a One dimension of the aperture is varied by ±0.0127 mm (or 0.0254 nun), while the other dimensions of the aperture remain 
the same size. 

b The third and fourth columns indicate the magnitude of the reflection coefficients over the downlink bandwidth for TE and 
TM polarizations, respectively, and the fifth column indicates the relative phase shift between these two polarizations over 
the downlink bandwidth. 

c The sixth and seventh columns indicate the magnitude of the transmission coefficients over the uplink bandwidth for TE 
and TM polarizations respectively, and the eighth column indicates the relative phase shift between these two polarizations 
over the uplink bandwidth. 

d This row indicates the performances of the Ka- /Ka-band dichroic plate at the designed dimensions with no variations. 
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Fig. 3. Transmission coefficient versus frequency for Ka-/Ka-band 
dichroic plate with stepped rectangular apertures for TE and TM 
polarizations. 



Fig. 4. Transmission coefficient versus frequency for Ka-/Ka-band 
dichroic plate with stepped rectangular apertures for TE and TM 
polarizations at Ka-band uplink. 




Fig. 6. Transmission power versus frequency for Ka-/Ka-band 
dichroic plate with stepped rectangular apertures for circularly 
polarized wave incidence. 



Rg. 7. Ellipticity versus frequency for KaVKa-band dichroic plate 
with stepped rectangular apertures for circularly polarized wave. 


Fig. 5. Relative phase shift versus frequency for Ka-/Ka-band 
dichroic plate with stepped rectangular apertures. 
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A Technique for Computation of Noise Temperature 
Due to a Beam Waveguide Shroud 

W. Veruttipong and M. M. Franco 
Ground Antennas and Facilities Engineering Section 


Direct analytical computation of the noise temperature of real beam waveguide 
(BWG) systems, including a]l mirrors and the surrounding shroud , is an extremely 
complex problem and virtually impossible to achieve . Yet the DSN antennas are 
required to be ultra low-noise in order to be effective, and a reasonably accurate 
prediction is essentia/. This article presents a relatively simple technique to com- 
pute a reai BWG system noise temperature by combining analytical techniques 
with data from experimental tests. Specific expressions and parameters for X-band 
(8.45-GHz) BWG noise computation are obtained for DSS 13 and DSS 24, now 
under construction. These expressions are also valid for various conditions of the 
BWG feed systems, including horn sizes and positions, and mirror sizes, curvatures , 
and positions . Parameters for S- and Ka^bands (2.3 and 32.0 GHz) have not been 
determined; however, those can be obteuned following the same procedure as for 
X-band. 


I. Introduction 

Noise temperature due to a BWG assembly is one of the 
major contributers to total antenna receive system noise 
temperature, especially for an ultra-low-noise system or a 
system with high spillover power in the BWG shroud. A 
reasonably accurate prediction of the BWG assembly noise 
temperature is essential. Direct analytical computation of 
the noise temperature of real BWG systems, including all 
mirrors, is an extremely complex problem and virtually 
impossible to achieve. 

This article presents a new technique that combines an 
analytical approach with data from measurement tests to 
construct a specific expression with some parameters to 
compute noise temperature in a real BWG system, includ- 
ing all mirrors. Appendix A gives the needed relationships 


among reflector material conductivity, resistivity, loss, and 
resultant noise. Experimental tests that have helped to 
establish this technique are presented in Appendix B. Ap- 
pendix C develops the effective noise temperatures, as seen 
by spillovers in the upper and lower portions of the BWG 
assembly. Appendix D contains physical explanations of 
the behavior of the field and noise temperature in the 
BWG shroud. The results from this technique have been 
used to predict BWG noise temperatures for DSS-13 Phase 
II and DSS 24. 

II. Characteristics of the Electromagnetic 
Fields Inside a BWG Shroud 

The total RF power originating from a horn aperture 
(viewed in transmission, for convenience) and propagating 
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through a BWG shroud (Pbwg) can be separated into two 
parts (see Fig. 1) 

Pb\VG = P m + Pspill (1) 

where P m is the majority of the total power (usually >95 
percent of Pq wg) that is always confined inside all BWG 
mirrors; it does not contact the BWG wall and there are 
no multiple reflection, diffraction, or creeping wave compo- 
nents. The power P m reflects from each and every mirror 
once before leaving the BWG shroud through the open- 
ing near Fi. P m can be computed easily because BWG 
wall and mirror interactions are small and ignored, which 
drastically simplifies the problem. The power that is con- 
fined inside the lower mirrors M6 (or M5, M4, etc.) is 
not considered as P m because leaks occur along the way 
up from M6 to Ml. For most practical cases where the 
total spillover loss is less than 5 percent, the total power 
confined inside the last mirror Ml is approximately used 
as P m . 

F S piii is the sum of spillover powers of each mirror. The 
P S piii fraction creeps and bounces around the BWG walls, 
mirrors, brackets (behind the mirrors), and the edges, etc., 
and suffers dissipation loss and consequent noise. On an 
average, the P sp ni power largely dissipates before a small 
remainder exits the BWG opening near Fi (some power 
may get back into the horn, but most is dissipated, with 
consequent transformation into noise). Even though P sp in 
can be computed accurately (P sp m = its field 

distribution and its chaotic behavior inside the lossy BWG 
shroud is virtually impossible to compute analytically. 

III. Noise Temperature Contributions Inside 
the BWG Shroud 

From Eq. (1), the corresponding noise temperatures are 

Tbwg = T m 4* T S pin (2) 

where T B wg is the total noise temperature (in kelvins) 
due to the BWG (including the shroud, mirrors, brackets, 
etc.). The values T m and T sp iu are the noise temperature 
contributions from P m and P 8 p jn, respectively. Because of 
the simplicity of P m , its corresponding noise temperature 
T m can be computed with acceptable accuracy. The details 
are given in Appendix A. From Appendix A [Eq. (A-6)], 
the noise temperature T m for DSS 13 or DSS 24 in kelvins 
at X-band is 


T m = 0.734P m /P BW G = 0.734a m 

where a m is the Pm fraction of PbwGj dimensionless. 

Similarly, because of the complexity of the field that 
contributes to P sp t]], it is not possible to analytically com- 
pute Tgpin . Unfortunately, T spi n is usually a major contri- 
bution to Tbwg (far example, DSS- 13 Phase II, X-band 
Tspin 3.0 K and T m ~ 0.7 K) and a reasonably accu- 
rate prediction is essential. The following section shows a 
technique used to compute T sp iii. 


IV. A Technique for T spm Computation 

Noise temperature due to spillover power P sp iii is given 
in a very simple form as 

T s pi„ = (Pi/ P bwg)Ti + (P2/ Pbwg)T2 = aiTi+a 2 T 2 (4) 

where Pi is the total spillover power of the two mirrors 
(M5 and M6) in the basement and the value P 2 is the to- 
tal spillover power of the four mirrors (Ml, M2, M3, and 
M4) above the basement ceiling. The values c*i and a 2 are 
the normalized powers (with respect to Pbwg) of Pi and 
P 2 , respectively. The values T\ and T 2 are source (or sink) 
effective noise temperatures seen by Pi and P 2 , in kelvins, 
respectively. In other words, and T 2 are effective noise 
temperatures that ori and a 2 see in the BWG assembly (in- 
cluding the shroud, mirrors, brackets, etc.). For example, 
if the BWG shroud were very short and highly conducting, 
T\ and T 2 would be equal and approximately T s k y . If the 
shroud were very long and lossy, T\ and T 2 would again 
be equal and approximately 273.16 + Theisms (~290 K). 

If one substitutes Eqs. (3) and (4) into Eq. (2), the 
BWG shroud noise temperature at X-band becomes 

Tbwg = 0.734a m + <*iTi + <*2T 2 (5) 

It is noted that a m -f ai + a 2 — 1*0. 

The value Tbwg was measured at DSS 13. The power 
fractions a m , ai, and o 2 in Eq. (5) are known and the 
unknown quantities are Ti and T 2 . By performing vari- 
ous measurements and perturbations at X-band, the co- 
efficients Ti and T 2 at X-band have been obtained (see 
details in Appendices B and C). 

DSS 13, no basement shroud: 
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T x = 300 ± 10 K 


T 2 = 240 ± 45 K 


DSS 24, full shroud: 


The effective temperatures T x and T 2 for S- and Ka- 
bands are expected to be slightly smaller and larger than 
the X-band coefficients, respectively. The accurate values 
of the S- and Ka-bands coefficients have not been deter- 
mined. 


T\ = 280 ± 20 K 
T 2 = 230 ± 45 K 


(6b) 


It is noted that Eqs. (6a) and (6b) both have steel shrouds 
(the same conductivity as DSS 13) above the basement 
ceiling, while DSS 24 has an additional large aluminum 
shroud in the basement. 

In general, the coefficients (7\ , T 2 ) apply to any BWG 
antenna with a similar shroud structure and (nearly) the 
same conductivity. The same coefficients are valid for var- 
ious conditions in a BWG system, including the horn sizes 
and positions, and mirror sizes, curvatures, and positions. 
See Appendix D for more physical explanations about the 
behavior of field and noise temperature due to the BWG 
assembly. 


V. Conclusion 

A technique for computing a BWG assembly noise tem- 
perature has been established and the technique can now 
be applied to the low-noise DSN BWG antennas. Spe- 
cific expressions and parameters for X-band BWG noise 
temperature computation of DSS 13 and DSS 24 are ob- 
tained from Eqs. (5) and (6). These expressions are valid 
for various conditions of the BWG feed system, including 
the horn sizes and positions, and the mirror sizes, curva- 
tures, and positions. The coefficients in Eq. (6) are not 
sensitive to small variations of BWG shroud structure (di- 
ameter, length, etc.), shroud conductivity, or operating 
frequency. Parameters for S- and Ka-bands have not been 
determined; however, they will not be dramatically differ- 
ent and can be obtained by following the same procedure 
as for X-band. The BWG noise temperature computed by 
this technique should be, at best, as accurate as a measure- 
ment result (plus some small error for the P sp iu computa- 
tion) and should be valid for all engineering applications. 
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Appendix A 

Noise Temperature Due to BWG Mirror Surface Resistivity 


Noise temperature due to one reflection of a wave nor- 
mally incident on a mirror surface is 1 * * * 


In order to compute T m [as shown in Eq. (2) for DSS 13 
or DSS 24], the noise temperature in Eq. (A-4) can be used 
with the following conditions: 


47? 

T ~ T p a , kelvins 
Zn 


(A-l) 


R s = ohms/square (A-2) 


where 

R s = surface resistivity of the mirror, ohms/square 
/ = frequency, GHz 

<j N = normalized conductivity — cr x 10“ 7 , mho/meter 
Z 0 = free-space wave impedance, ohms 
T p = physical temperature, kelvins 
a = power fraction illuminating the mirror 

For a circularly polarized wave with an average incident 
angle to the mirror equal to 9, an approximate expression 
for T, valid where 9 is away from grazing incidence, is 


T 



^cos 9 *f 



(A-3) 


The total noise temperature of n mirrors is 


(1) The surface conductivity is the same for all six mir- 
rors with a = 2.3 x 10 7 mho/meter (for 6061T6 
aluminum). 

(2) The power fraction is also the same for all mirrors 
and equal to a m , where a m is the main fraction of 
^BWgK = ^m/^DWG)' 

(3) The incident angle is 6 = 45 deg for the four mir- 
rors (Ml, M2, M3, and M4) above the ceiling and 
9 = 30 deg for the two basement mirrors (M5 and 
M6). 

(4) The physical temperature T p = 290 K. 

(5) The frequency / = 8.45 GHz will be used because 
all measured data in Appendix C are at 8.45 GHz. 

From Eq. (A-4) and the above five conditions, the to- 
tal noise temperature due to surface resistivity of all six 
mirrors of DSS 13 or DSS 24 becomes 


2 TpR, 

Z< 


rrr 

* m — 


^(2 ( 


cos 30 + 


1 


cos 30 


+ 4 (' os45+ sr35)} (A * 5) 


_ 2T p ( 1 A rA-4^ Substitute T p = 290, Z Q = 1207T, / = 8.45, and <jn — 2.3 

~ Z 0 " sl0tl \ ° S * + cos 9i ) ^ ^ into Eqs. (A-2) and (A-5), and one obtains 


where R $ i> a*, and 9{ are R S} a, and 9 of the ith mirror, 7^ _ o.734a m (A-6) 

respectively. 


1 D. A. Bathker, “Planewave Reflection Noise Temperature Due to 

Surface Resistivity,” JPL Interoffice Memorandum 3328-92-0144 

(internal document), Jet Propulsion Laboratory, Pasadena, Cali- 

fornia, August 20, 1992. 


Note that in a reasonable design with a m = 0.95 or 
more, the dissipation of six aluminum BWG mirrors alone 
causes a noise component of about 0.7 kelvin at X-band. 
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Appendix B 

BWG Noise Temperature Experimental Tests 


T. Y. Otoshi and M. M. Franco performed several mea- 
surements at DSS 13 in order to gain a better understand- 
ing of the BWG assembly noise temperature. The results 
show a higher than (initially) expected noise temperature 
(which was about 4 K). By adding skirts around the rim of 
the BWG mirrors in these tests, the noise temperature was 
reduced only a fraction of a kelvin. The two mirrors in the 
basement normally do not have shrouds. After construct- 
ing a temporary shroud (made with lightweight insulating 
material, aluminum foil-coated on two sides, and a foam 
center) that partially enclosed the two mirrors, the noise 
temperature dropped only 0.25 K. 

The above results led to further investigation by 
W. Veruttipong and M. M. Franco. The experimental tests 
described below (performed by Veruttipong and Franco 
between May 31 and June 3, 1991) help to explain why the 
BWG noise temperature is higher than initially expected. 
More important, their results led to a new technique used 
to compute the BWG noise temperature. 

Experiment 1. The purpose of this experiment was 
to observe the effect of the basement shroud on system 
noise temperature. 

A tightly fitted and fully enclosed basement shroud was 
made from a partially enclosed shroud left over from Oto- 
shi and Franco’s experiment (see Fig. B-l). The same foil 
material was used to patch all the opening spaces. Alu- 
minum tape was used to seal all small gaps. J. Withington 
built an RF probe to check for any RF signal leaks around 
the new shroud. The JPL standard 22-dB X-band corru- 
gated horn was used, and its aperture was positioned at 
3.34 in. above F3 throughout the experiments. The fol- 
lowing system noise temperatures were recorded: 

Fully enclosed basement shroud 

Top = 33.62 K 

Partially enclosed basement shroud ^}A = 0.23 K 

T op = 33.85 K 

No basement shroud ^A = 0.25 K 

T op = 34.10 K 


Conclusion 1 . The results clearly show that the base- 
ment shroud does not significantly reduce the BWG noise 
temperature. In other words, the spillover field in the base- 
ment still sees a high effective noise temperature (close to 
300 K) with or without the basement shroud. 

Experiment 2. The purpose of this experiment was 
to reconfirm the results from Experiment 1 that spillover 
power sees a high effective noise temperature in the BWG 
shroud. 

The main field inside the BWG shroud (P m ) was ex- 
cited so that it had a characteristic similar to P sp iu in 
Fig. 1. By causing mirrors M5 and M6 to be rippled (by 
putting curly aluminum tape all over the mirror surfaces), 
the field inside the shroud was obviously chaotic and sim- 
ilar to Fig. E-l(c). The tightly fitted basement shroud 
was still in place and the noise temperature reading was 
235.5 K. 

Conclusion 2. The simulated scattered field is not 
quickly guided by the shroud to the sky. In fact, it bounces 
around and effectively creates a high dissipation loss. The 
235.5 K does not represent any specific parameter. It is 
just an indicator showing that the simulated scattered field 
(or spillover field in the real BWG) sees an effective noise 
temperature on the order of 200 K. The accurate value is 
computed in Appendix C. Earlier in the experiment, it was 
not fully understood that the effective noise temperature 
for spilled or scattered power was approximately T s ^ y , and 
that is why the measured noise temperature was higher 
than expected. 

Experiment 3. The purpose of this test was to see 
whether there would be any unusual noise temperature 
measurements when the shroud was short-circuited at vari- 
ous locations. The intent was to see whether the noise tem- 
perature readings were high (300 K ±A), medium (150 K 
=LA), or low (50 K ±A). Therefore, a very high-precision 
experimental setup was not necessary. 

The short circuit was done by closing the BWG tube 
with a foam-backed foil disk. Five of the six mirror surfaces 
were intentionally rippled (see Experiment 2) and the noise 
temperature readings were 

Short circuit near F 1 : T op l = 296 K 

Short circuit at the ceiling: T OPi 2 = 304 K 


13 



Short circuit just above the cement: T OPi 3 = 304 K 

Short circuit near (all mirror surfaces are good): 

Top, 4 = 131 K 

Conclusion 3. All readings (except T op , 4 ) are 300 K 
±5 K, as expected, because a perfectly closed cavity has 
an ambient noise temperature. The slightly lower reading 
of T op ,i results from an imperfect short circuit near Fi 
due to a relatively strong wind. It is noted that if all 
the mirror surfaces are good during the measurement, the 
noise temperatures will differ greatly, depending upon the 
location of the short circuit. For example, if the short 
circuit is exactly at the waist of the beam (near Fi or 
F 2 ), the noise temperature will be low (less than 100 K) 
because most of the energy will get back to the horn. The 
noise temperature T OPi 4 is not less than 100 K because the 
short-circuit location is not at the waist of the field and 
the short-circuit disk is neither perfectly level nor flat. 


Experiment 4. The purpose of this test was to mea- 
sure the effect of a section of concrete in the shroud (see 
the location of the concrete in Fig. B-l). 

To investigate the impact of a concrete surface segment 
(unlined shroud portion) on the BWG noise temperature, 
the BWG system was returned to its normal operational 
condition, i.e., no rippled surfaces, no short circuit, and 
no basement shroud. One-inch foam-backed foil was used 
to cover the concrete surface. System noise temperatures 
were recorded with and without the covering. The result 
shows practically no change in the noise temperature. 

Conclusion 4. The concrete in the BWG shroud is not 
a major source of BWG shroud noise and the concrete does 
not create extra noise temperature in the BWG system. 
This was expected, as the concrete section is near a beam 
waist. 



Fig. B-1. DSS-13 BWG antenna with a temporary basement shroud 
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Appendix C 

Effective Noise Temperatures (T, and T 2 ) 


The noise temperature coefficients or effective noise 
temperatures, as shown in Eq. (4), will be evaluated at 
X-band for DSS 13 and DSS 24. Recall Eq. (5), 

Tbwg = 0.734a m + oi\T\ + <* 2^2 

The coefficients for DSS 13 will be considered first. The 
measurement at DSS 13 reveals Tbwg = 8-9 i 0.4 K [1]. 
The spillover power of every mirror for DSS 13 is com- 
puted, without the BWG wall, by using Physical Optics 
(PO) software, and the power is normalized to the total 
horn output power. The results are 

a m = 0.9694 

«i = 0.0138 (C-l) 

a 2 “ 0.0168 


Noise temperature coefficients for DSS 24 will be con- 
sidered next. From Experiment 1 in Appendix B, the dif- 
ference in noise temperatures of DSS 13 with and without a 
basement shroud is about 0.5 K. DSS 24 has a larger base- 
ment shroud than the one in the test (Fig. B-l), which re- 
sults in a slightly higher system noise temperature (about 
0.1 to 0.3 K higher). However, the smaller gap between 
mirrors and the BWG wall of DSS 24 helps to reduce the 
noise temperature by about 0.2 K. On balance, it is rea- 
sonable to say that the result from DSS 13 with the shroud 
should approximately represent DSS 24. Recall that with 
the temporary basement shroud, the DSS- 13 noise tem- 
perature decreased by 0.5 K. One can assume that 0.3 K 
is related to ax and 0.2 K is related to a 2 (gap effect in- 
cluded). Even though this assumption of the ratio contri- 
bution may be questionable, the total noise temperature 
difference of 0.5 K is always preserved. 

From Eqs. (5) and (C-l), one can write 


Since there is no shroud in the basement of DSS 13, a : is 
mostly dissipated in the basement, therefore, 

T x = 300 K (C-2) 


ArT1 0.3 0.3 

ATl ~ ai~ 0.0138 


0.2 _ 0.2 
A7z _ a 2 “ 0.0168 


Substitute Tbwg = 8.9 K, Eqs. (C-l) and (C-2) into From Eqs. (C-2), (C-3), and (C-4), the coefficients for 
Eq. (5), and one obtains DSS 24 are 


T 2 - 240 K (C-3) 

For the deviation of Tbwg °f i0.4 K and the spillover 
power within ±0.0005 for ct\ and dbO.0010 for a 2 , the de- 
viation of T 2 becomes ±45 K. It is noted that the value of 
T\ = 300 K is quite well known and is an accurate value 
(a deviation of zb 10 K may be applicable). Therefore, the 
accuracy of T 2 is nearly as good as the measurement can 
get (plus a small error in PO computation). 


TV - 300- 20 = 280 K 
T 2 = 240 - 10 = 230 K 


(C-5) 


The deviation of T\ for DSS 24 should be increased to 
±20 K due to a small uncertainty (~ ±0.1 K) of 0.5-K 
power distribution. The deviation of T 2 for DSS 24 should 
be approximately the same as DSS 13, which is about 
±45 K. 


Reference 

[1] T. Y. Otoshi, S. R. Stewart, and M. M. Franco, “Portable Microwave Test Pack- 
ages for BWG Antenna Performance Evaluations,” IEEE Trans. Microwave The- 
ory Tech vol. 40, no. 6, June 1992. 
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Appendix D 

Field and Noise Temperature Due to a BWG Shroud 


The spillover power past a mirror is defined here as a 
power originating from a single source that is not incident 
on the mirror in the absence of a BWG wall and other mir- 
rors. In reality, with BWG walls and all mirrors included, 
the spillover power of each mirror will spread (bounce 
arqund) nonuniformly throughout the shroud. The dissi- 
pation losses are associated with the conductivities of the 
walls, mirrors, brackets, etc., in a very complex and unde- 
termined combination. Total spillover noise temperature 
is the sum of the contributions of all the mirrors. Besides 
the complexities of the scattered fields related to the mir- 
ror brackets, edges, bolts and nuts, welding scars, gaps 
between mirrors and BWG walls, etc., one must also deal 
with a partially open cavity problem of the lossy BWG 
shroud. Based on over five years' experience in funda- 
mental research with reflection, diffraction, and creeping 
waves of various surfaces and objects, it is concluded that 
the characteristic of P S pili (and T sp jij) is so chaotic in a 
very lossy and complex system that it is not possible to 
compute analytically. 

One can draw a similarity between the expression 
T^pin = ot\T\ + & 2 T 2 in Eq. (4) and Ohm’s law V = IR. 
The resistance R (in ohms) is a result of thermal agita- 
tion of electrons, which prevents current (free electrons) 
from moving freely through the resistor R. The relation 
V = IR applies for a macroscopic region in an electric 
circuit, and R is obtained from a relation R — V/L It is 
extremely difficult to compute R from a microscopic point 
of view by computing the total energy loss from continual 
collisions of electrons or by determining an average rate of 
flow of free electrons through the thermally excited lattice 
structure of the resistor. The resistance R is linear and 
isotropic within a wide range of I (similary, T\ and T 2 are 


linear and perhaps isotropic). The linearity of 7\ (simi- 
larly for T 2 ) in Eq. (4) physically means that the behavior 
(or the degree of chaos) of cti is approximately the same, 
regardless of the magnitude of oq (as long as there is no 
dielectric breakdown). 

The reason why T sp jn in Eq. (4) is separated into two 
parts (above and below the subterranean room ceiling) is 
that the shroud in the basement is very different from the 
shroud above. In fact, DSS 13 does not have a basement 
shroud at all. Fortunately, this helps to improve the ac- 
curacy of Eq. (6a) because T x = 300 K is an accurate ef- 
fective noise temperature for a very large basement room, 
provided that the exit “hole” is a small area, as compared 
with the total basement surface area. This leads to an ac- 
curate determination of T 2 because it is the only remaining 
unknown. Adding small perturbations with the guidance 
from measurement results leads to T\ and T 2 of DSS 24, 
as shown in Eq. (6b). Suppose that a horn were located 
at F 2 and there were no basement. The coefficient T\ does 
not exist, T 2 will be smaller than the one given in Eq. (6b) 
because there is no (close to) room temperature load below 

f 2 . 

For low spillover loss in a BWG shroud, the total 
antenna system noise temperature is not sensitive to a 
small variation (say, ±20 K) of T\ or TV For exam- 
ple, for DSS-13 Phase II X-band with a total system 
noise temperature of 35 K and normalized spillover power 
Ogpin = 0.012, ±20 K variations in T resulted in only 
0.012(±20) = ±0.24 K. Also, for an extremely low spillover 
loss (one mirror spill <0.05 percent), the noise tempera- 
ture computation may not give a sensible result because 
of the possibility of inaccurate spillover computation. 
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A self-dual (48,24;12) code comes from restricting a binary cyclic (63,18;36) 
code to a 6 x 7 matrix, adding an eighth all-zero column, and then adjoining six 
dimensions to this extended 6x8 matrix. These six dimensions are generated by 
linear combinations of row permutations of a 6 x 8 matrix of weight 12, whose 
sums of rows and columns add to one. A soft decoding using these properties 
and approximating maximum likelihood is presented here. This is preliminary to a 
possible soft decoding of the box (72,36;15) code that promises a 7.7-dB theoretical 
coding under maximum likelihood. 


I. A Self-Dual (48,24;12) Code 

Consider the BCH (63,18;24) code of length 63 gener- 
ated by the recursion polynomial fi(x)f 3 (x)f-i(x), where 
fi(x) = x 6 + x + 1 with a root j3 that is a primitive gener- 
ator of the 63rd roots of unity in GF(6 4). Here f 3 (i :)/,(*) 
contains (3 3 and /? -1 as roots, respectively. Restrict the 
values of the code to the coordinates 9 i + 7 j for 0 < i < 6, 
j = 1, 2, 4, 5, 7, 8. Thus, a (42,18;12) code has been 
constructed. To prove this, one examines the matrix in a 
Mattson-Solomon (MS) polynomial formulation over the 
rows. 

For z — xy> where x 1 — 1, y 9 = 1, x = P 9t > 
and y = (3 7j , indexing the rows by y , the MS poly- 
nomial for each row is Py(x) — Tr ( C\y + (Ciy) 8 )® H - 
(. C 3 y 3 + Cfy 6 )x 3 + {C x y l + Cfy l )x 5 . This polynomial be- 
comes, in the Solomon-McEliece T 2 Formulation, P = 
Tr (Ciy + (Ciy) 8 )x + (C 3 2 y 6 + C^y 3 + C?y l + Cfy 8 )x 6 . 

Thus, the coefficient in x is seen to be a (6,2;5) code 
over GF{ 8), while the coefficient of x 6 is a (6,4;3) code 
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over GF(8). The minimum binary weight of the six rows 
is >10. Now, summing 14 over the rows, one can see 
that this adds to 0, yielding weights that are multiples 
of 4, and thus proving that the minimum distance of the 
code is >12. Note that the sum of the rows is the (7,3;4) 
codeword given by Tr (C 3 + Cf)x 6 . 

Adjoin an eighth column to this 7x6 matrix. Now add 
six more dimensions by forming linear combinations of all 
cyclic row permutations of the single matrix 


/0 0 0 0 
0 0 0 0 
0 0 0 0 
0 0 0 0 
0 0 0 0 
V i i i i 


0 0 0 1 \ 
0 0 0 1 
0 0 0 1 
0 0 0 1 
0 0 0 1 
1110 / 


The newly constructed code of length 48 and dimension 
24 has a minimum distance of 12. The dimension 23 code 
coming from pairs of rows with weight 24 is easily seen to 
have distance 12 and row sums equal to zero. For the 24th 
dimension, whose row sums are odd, one need only check 
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that certain weight patterns in the dimension 18 code did 
not exist. 

To verify the results, note that ^2 = 0, j = 1, 2, 4, 5, 
7, 8, and investigate the weight forms in any row permuta- 
tion (6, 6,6, 6,6, 6), (6, 6,6, 6, 2, 2), (6, 6, 6, 6, 4,0), (2, 2, 2, 2, 2, 2), 
etc., to verify that this addition does not alter the basic 
minimum distance and self-orthogonality. 

II. Soft Decoding This Code 

To decode this code using soft-decision information, 
first assume that the rows are of even parity. There are 
six (8,7 ;2) binary codes with the coefficients of x forming 
a (6,2;5) code over GF(S). There are 64 such possibilities, 
and these are stored as six (8,3;4) codes that are cyclic ex- 
tensions of the maximal length shift register codes Tr cx, 
x = /?*, where 0 < i < 6. Adding these six codewords 
to a received word, six extended BCH-Hamming (8,4;4) 
words are left to maximum likely decode. These words 
must have coefficients of x 6 that form a (6,4;3) pseudo- 
code. Conducting fifteen trials where four words are as- 
sumed correct and generating the rest of the words will 
give a set of soft-decoded values. Thus, in 64 x 15 trials a 
candidate emerges for maximum likelihood decoding. This 
technique will correct all hard-decision five-error patterns, 
and almost all six-error patterns. Assuming an odd parity 
in the eighth column, one uses 2 x 64 x 15 total examina- 
tions (2020 trials) in total. How close this is to maximum 
likelihood is a yet unanswered question. 

The 64 words are generated by taking the recursion 
x 6 + x-j-l to generate six linearly independent words, plac- 
ing the word in the 9x7 matrix as prescribed, and then 
limiting each codeword to the rows 1, 2, 4, 5, 7, and 8. 
This will give six generators of the (6,2;5) code and so will 
yield 64 words. For the (6,4;3) code, generate the cyclic 
code formed by (x 6 -1- x 4 + x 2 + x + l)(x 6 + x 5 + 1). Gener- 
ate 12 linearly independent words as above and limit each 


codeword to the rows 1, 2, 4, 5, 7, and 8 to give 12 gen- 
erators of the (6,4;3) code over GF(8). This is the form 
desired. 


III. Analysis of Performance 

How close is this to maximum likelihood performance? 
Performance here consists of maximum likelihood decod- 
ing of the six BCH-Hamming (8,4 ;4) codes and assumes 
at least four rows are correct. This clearly will not work if 
three or more of the decoded rows are incorrect. This is the 
key factor to decoding correctly. If p is the decoding error 
under the maximum likelihood of the (8,4;4) code, then 
the decoding error is 5^ =3 p*(l - p) 6_, \ This is roughly 
20p 3 for the entire code. 


IV. Soft Decoding the (72,36;15) Code 

To decode the code in [1] using soft- decision informa- 
tion, first assume that the rows are of even parity. There 
are nine binary (8,7;2) codes with coefficients of x 6 form- 
ing a maximal-distance-separable (MDS) (9,3;7) code over 
GF( 8). There are 128 such possibilities, and these are 
stored as nine (8,3;4) codes that are cyclic extensions of 
the maximal length shift register codes Tr cx 6 , x = /?*, 
where 0 < i < 6. Adding these six codewords to a received 
word, nine extended BCH-Hamming (8,4;4) words are left 
to maximum likely decode. These decoded words are now 
symbols that are coefficients of x that form a (9,6;4) code 
over GF{ 8). Eighty-four trials, where six symbols are as- 
sumed correct to generate the rest of the symbols, will give 
a set of values for soft decoding. Thus, in 128x84 = 10,752 
trials, there emerges a candidate for maximum likelihood 
decoding. This technique will correct all hard-decision, 
seven-error patterns and almost all eight-error patterns. 
Assuming an odd parity in the eighth column, there have 
been 2 x 10,752 = 21,504 total examinations. 


Reference 

[1] G. Solomon, “Self-Dual (48,24; 12) Codes,” The Telecommunications and Data 
Acquisition Progress Report 42-111, vol. July-September, pp. 75-79, Novem- 
ber 15, 1992. 
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A (72,36;15) box code is constructed as a 9 x 8 matrix whose columns add to 
form an extended BCH-Hamming (8,4;4) code and whose rows sum to odd or even 
parity. The newly constructed code, due to its matrix form, is easily decodable 
for all seven-error and many eight-error patterns. The code comes from a slight 
modification in the parity (eighth) dimension of the Reed-Solomon (8,4;5) code over 
GF (512). Error correction uses the row sum parity information to detect errors, 
which then become erasures in a Reed-Solomon correction algorithm . 


I. The Code Construction 

The first 27 dimensions of the codes constructed basi- 
cally constitute a (63 ,27; 16) code represented as a 9x7 ma- 
trix. This arises from a Reed-Solomon (RS) cyclic (7,3;5) 
code over GF(512). The last nine dimensions are con- 
structed by modifying the construction of the extended 
RS (8,4;5) code over GF(512). For eight of the nine di- 
mensions, this is exactly the extended Reed-Solomon code. 
For the ninth dimension, the encoding algorithm is mod- 
ified. Encoding is direct and systematic. Decoding the 
code uses error /erasure techniques as discussed in [1]. 


II. A Modified Reed-Solomon (8,4 ;5) Code 
Over <7^X51 2) 

Recall first that the extended RS (8,4;5) code over 
GF(8) represented in binary form in a normal basis is iso- 
morphic to the extended Golay (24,12;8) code. However, 
the modified extended RS (8,4;5) code over GF(G 4) repre- 
sented in binary form is a self-dual (48, 24; 12) code. These 
were shown in [1]. In both these codes, the decoding must 
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sometimes go through eight such trials corresponding to 
an ambiguity of elements in GF( 8). The decoding proce- 
dure here will exhibit the same ambiguity for one special 
case of seven errors that appear as erasures. 

Using techniques similar to those in [1], if one starts 
with an RS (8,4;5) code over GF(512), and represents the 
code in binary using a particular normal basis with the 
special property defined below, one can generate a code 
of length 72 and dimension 36 with even weights that are 
multiples of 4 and odd weights of form 4m — 1. 

The binary representation of the usual RS (8,4;5) code 
over GF(512), yields nine (8,7;2) codewords whose decom- 
position into two cyclic code components and a constant 
component looks, respectively, like (9,7;3) and (9,2;8) RS 
or maximal-distance-separable (MDS) codes over GF{ 8) 
and a binary (9,9;1) code. However, the code here is con- 
structed by modifying the extended coding rule for the 
parity symbol. 

In particular, let 7 be a root of the polynomial f(x) = 
x 9 + x A + x 6 + x* + x 4 -f x + 1, where 7 is a primitive 
generator of the 511 roots of unity. Represent the el- 
ements of GF(512) in the normal representation using 


19 


the roots of /(x). The roots are y\ where j £ J for 
J = {1,2,4,8,16,32,64,128,256}. 

Note that for this particular choice of /(x ), one has 

IV y = 1, j£j } J = {1,2,4,8,16,32,64,128,256} 

Tr 7*7* = 0, i / *, i,k € J 

Let (3 be a root of the polynomial ^(x) = x 3 -f + 1. 
Here p is an element of GF(8), a subfield of GF(512), and 

P = 7 73 . 

A. Encoding 

Now use the recursion or check polynomial h(x) = 
n? = o(*+^) S enerate an ^ (7,4 ;4) code over GF(512). 
This means that the initial shift register contains four ele- 
ments in GF(512) expressed as coefficients in the normal 
representation above. The cyclic portion of the code is 
of length seven, and the overall parity symbol, the eighth 
dimension, is defined differently. Representing the binary 
code as components Tr P(x)7* , where i = 1, 2, 4, 8, 16, 32, 
64, 128, 256, extends the codes to the eighth coordinates 
by the rules; the binary value at the row indexed by the 
fth coordinate is given by Tr Co7* -f Tr Ylje j Co7* . 

Thus, for the constant term Go with Tr Go = 0, this 
symbol behaves like the normal parity symbol, which is a 
sum over the values of the cyclic code coordinates. 

The general Mattson-Solomon (MS) polynomial of a 
codeword a is P a (z) = Go + C\X 4- C^x 2 + G3X 3 , where 
G, € GF(512) for 0 < i < 3 and x £ GF(8). Encode the 
codeword in its cyclic portion. The extended codeword a 
expressed in terms of the MS polynomial is 

a = (W), 0 < i < 6, P a (0)) 

Writing the codewords in binary and using the normal 
basis for j £ J above, there are nine binary codewords 
of length eight 

TV P(x)y, j= 1,2,4,8,16,32,64,128,256 

where TV a denotes the value in GF(2) given by the trace 
of an element a £ GF(512) 

Tr a = a + a 2 + a 4 + a 8 + a 16 + a 32 + a 64 + a 128 + a 256 


Consider one of the nine binary words in its Mattson- 
Solomon setting, 

Tr P a (a:)y = TV (C 0 + Ci x + C 2 x 7 + C 3 * V 

= Tr C 0 j j + Tr '[(C x x + C 2 x 2 + C 3 x 3 )y j ] 

+ Tr '{{{Cxx + C 2 x 2 + C 3 x 3 ) 7 J ') 8 

+ {{Cxx + C 2 x 2 + C 3 x 3 ) 7 j ) 64 ] 

Tr 'a = a + a 2 + a 4 , a £ GF{ 8) 

Set Go = 0 temporarily, as this does not affect the argu- 
ments to follow. Then, 

Tr P{x ) y = TV '{Cxi 5 + {Cxi 5 f + {Cxi 5 ) 64 

+ (W ) 256 + (C 27 256i ) 8 + (C 27 256i ) 64 )x 

+ Tr \{C 3 i 5 ) 2 + {C 3 i 5 ) 16 + {Cal 5 )™)* 6 

Lemma. The coefficient of x is a (9,6:4) code over 
GF(8). The coefficient of x 3 , and consequently of x 6 , is 
a (9,3;7) code over GF(8). The code is indexed by the 
values of 7 ; , where j £ J = (1, 2, 4, 8, 16, 32, 64, 128, 
256}. 

Proof. The set 7 J , where j £ J = (1, 2, 4, 8, 16, 
32, 64, 128, 256} can only take zero values one less than 
the number of terms in the coefficients of x and x 6 . An 
argument that clarifies this follows. Consider the coeffi- 
cient of x 3 . This is a polynomial of degree 64 for which, 
if 7 is a solution for a value G3, then Ay is also a solution 
for all A £ GF(8). Thus, there can only be at most two 
values of 7 that make the coefficient of x 3 equal to zero 
and consequently the coefficient of x 6 equal to zero. A 
similar examination of the degrees in the coefficient of x 
will yield the above result. The term Tr Go7 ; in the code’s 
expression when TV Go = 0, i.e., the constant terms, forms 
a binary (9,8;2) code. 

Theorem. The RS code determined by codewords 
with MS polynomials P a (x) and Tr Go = 0 forms a bi- 
nary (72,35; 16) code with weights that are multiples of 
4. 

Proof. The multiple-of-4 property of the weights using 
the Solomon-McEliece T 2 Formula follows: 
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Tr P(x) y = Tr ((C l7 + (C 2 y) 4 + (C 3 7 V) 

where Tr is defined in GjF(64). 

Now 

r 2 (Tr P(x)y) = Tr [CiC| 7 3 + C?C 3 2 7 10 + CfC| 7 66 

+ c^cjV 7 + cfci V 4 + c^ciV 0 
+ CjC-PV 29 + CfCj 2 V 36 + C 1 64 C'i 28 7 192 
+ c 256 c 2 7 258 + c 256 cf 7 272 + c 256 ^ 384 
+ ClCiy 6 + C 2 4 Cf 7 20 + C 2 4 Cf 8 7 132 C 2 32 C 3 2 7 34 
+ C 32 C > 6 7 48 + cfc^V 60 ] 

Similarly, one can compute r 2 (Tr P(x)y 21 ), where j < 
8 and take the sum over all 0 < j < 8 to obtain 
^(Tr P(x) 7 J ) = 0. This results from the choice 
of the normal basis so that 

Tr y j = 1, j 6 J, Tr 7 > +t = 0, j,k € J 

It has been demonstrated that the binary weight of any 
codeword in the RS code above is a multiple of 4. 

III. Structure of the Code 

An examination of the binary version of the RS code 
reveals nine words whose cyclic components form a (9,6;4) 
and a (9,3;7) code over GF( 8). Thus, the symbol weights 
are 4, 5, 6, 7, 8, and 9. For weights 4, 5, and 6, one 
has binary weights of the code 16, 20, and 24. When the 
(9,3 ;7) code is nonzero, one has a minimum code weight 


of 14. But, since the codewords have weights that are 
multiples of 4, one has a minimum weight of 16. 

Now consider Co by itself when Tr Co = 0. This is a 
binary (72,35;16) code. The encoding here has ceased to be 
systematic since the condition Tr Co = 0 is nonsystematic. 

The extension parity rule for Co has been changed 
to give the following: For i = 1, Tr P(x)j i = 1, and 
Tr P(x) 7* = 0, where i / 1, an eighth row of weight 8 
is adjoined, e.g., 011111111. The argument invoked above 
[1] can then be used to prove that the minimum code dis- 
tance is 15 and all odd weight words have weights of form 
4m — 1. 

IV. Decoding Binary 

Assume at first that Tr Co = 0. Then the parity sums 
over all nine binary codes give odd error-pattern infor- 
mation. Thus, with seven errors or less spread out over 
the nine words, at least two values of Ctf* must be cor- 
rect. A trial of eight other values for C3 will eliminate the 
cyclic component attached to x 6 . Since each of the nine 
binary codes is now an odd-error-detecting/single-error- 
correcting code, the parity information is usable to correct 
single errors when they occur. In the case of seven single 
binary error patterns in seven different rows, a complete 
correct decoding emerges. This is the most complex de- 
coding to perform, as it requires eight decoding trials. If 
the seven or less error patterns are in a smaller set of the 
nine binary codes, then once C 3 is determined and sin- 
gle errors are corrected, there will be at least six correct 
coefficients of x. 

Undecodable eight-error patterns occur when there are 
less than three of the nine binary codes that are error-free. 
Once the x 6 coefficient is obtainable without too many 
trials, then there must emerge, after single-error correction 
of the erasure codes, at least six correct coefficients of x. 
One can of course try 512 values of C3 and then correct 
for single errors in what remains and finally decode the 
remaining (9,6;4) code over GF(S). 
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Convolutional codes have been the central part of most error-control systems in 
deep-space communication for many years . Almost all such applications , however f 
have used the restricted class of (n,l), also known as “rate 1/n,” convolutional codes . 
The more general class of (n,k) convolutional codes contains many potentially useful 
codes , but their algebraic theory is difficult and has proved to be a stumbling block 
in the evolution of convolutional coding systems. In this article, the situation is 
improved by describing a set of practical algorithms for computing certain basic 
things about a convolutional code (among them the degree , the Forney indices , a 
minimal generator matrix , and a parity-check matrix), which are usually needed 
before a system using the code can be built. The approach is based on the classic 
Forney theory for convolutional codes , together with the extended Smith algorithm 
for polynomial matrices , which is introduced in this article , 


I. Introduction 

In his celebrated paper on the algebraic structure of 
convolutional codes, Forney [2] showed that by using the 
algebra of k x n polynomial matrices, in particular the 
invariant-factor theorem (aka the Smith Form), one can 
transform an arbitrary generator matrix for an (n, k) con- 
volutional code C into a noncatastrophic, basic, and ulti- 
mately minimal, generator matrix for C . He also showed 
how to find a polynomial inverse for a basic generator ma- 
trix for C, and a basic generator matrix for the dual code 
C 1 . In this article, efficient ways are discussed to do all 
these things. The main tool is an algorithm, called the 
extended Smith algorithm, which is used to find the in- 
variant factors of an arbitrary k x n matrix over an Eu- 


clidean domain, which bears the same relationship to the 
usual invariant factor algorithm as the extended Euclid’s 
algorithm bears to the usual Euclid’s algorithm. 

A brief review of Euclid’s algorithm (see e.g. [5, Section 
1.1] or [9, Chapter 2]) is helpful. The algorithm’s goal is to 
take a pair (a, 6) of elements from a Euclidean domain R, 
and by repeated application of the division algorithm for R 
to find the greatest common divisor d of a and b. The goal 
of the extended Euclidean algorithm (see e.g. [6, Section 
4.5.2] or [8, Section 8.4]) is to take the same pair, and not 
only find d, but also find elements s and t of R t such that 
sa + tb = d. The extended Euclidean algorithm has several 
important applications in coding theory. For example, it 
can be used to compute inverses in finite fields [9, Exam- 
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pie 4.2], decode BCH codes [8, Section 8.5], and to find 
finite impulse response (FIR) inverses for noncatastrophic 
(n, 1) convolutional generator matrices [1, Section 12.2]. 
Similarly, the goal of the Smith algorithm (see e.g. [3, 
Section 6.2.4], [4, Section 6.3.3], or Smith’s nineteenth- 
century original article [10, Section 12.2]) is to take an 
arbitrary k x n matrix G (with k < n) over an Euclidean 
domain F, and by a sequence of elementary row and col- 
umn operations, to reduce G to a k x n diagonal matrix 
T = diag(7i, . . . ,7*), whose diagonal entries are the in- 
variant factors of G, i.e., j * = A,/A,_i, where A* is the 
greatest common denominator of the i x i minors of G. 
(Here, Ao = 1 is taken by convention.) Smith’s algorithm 
is reviewed in Section II. 

The goal of the extended Smith algorithm , which is in- 
troduced in this article, is to take the same input, and not 
only find T, but also to find a k x k unimodular matrix X , 
and annxn unimodular matrix Y, such that XGY = T. 
It is worthwhile to note that in the special case k = 1 
and n = 2, the (extended) Smith algorithm reduces to 
the (extended) Euclidean algorithm. The extended Smith 
algorithm is described in detail in Section III. 

Section IV describes how, given an arbitrary generator 
matrix G for an (n,fc) convolutional code, the extended 
Smith algorithm can be used to efficiently compute the 
things mentioned above (the degree, the Forney indices, a 
minimal generator matrix, a polynomial inverse, a parity- 
check matrix, etc., for the given code). Throughout this 
article, all results are illustrated with an example of a 2 x 4 
matrix of polynomials over the binary field GF{ 2), which 
is a generator matrix for a (4,2) convolutional code over 
GF( 2). 


II. Smith’s Algorithm 

In this section, a careful description is given of Smith’s 
algorithm (which is often called the invariant-factor algo- 
rithm), but a formal proof of its correctness is not given. 
For that, refer to [3, Section 6.2.4], [4, Section 6.3.3], or 
[11, Section 12.2]. 

Begin by recalling the definition of a general Euclidean 
domain [8, Chapter 2]. It is an integral domain, i.e., a 
ring that satisfies the cancellation property, together with 
a “size” function \a\ defined for every nonzero element a E 
R. The size function must satisfy \a\ < |a&| if b / 0. 
Furthermore, if a and b are arbitrary, and b £ 0, a can be 
“divided” by b in the sense that there exist elements q (the 
“quotient”) and r (the “remainder”) such that a - qb+r, 
where either r = 0 or else |r| < |fc|. For application to 
the study of convolutional codes, always take R to the 


ring of polynomials over a field F, where the “size” of a 
polynomial is its degree. However, there are many other 
Euclidean domains [8, Chapter 2], and Smith’s algorithm 
applies equally to all of them. 

The central part of Smith’s algorithm is the following 
subalgorithm E, which takes as input an arbitrary matrix 
A with entries from an Euclidean domain F, at least one 
of which is nonzero, and, via elementary row and column 
operations, transforms A into a matrix with the (1,1) entry 
nonzero, every other entry in row 1 and column 1 zero, and 
every other entry in the matrix divisible by the (1,1) entry. 

El. Move the entry in A of least size to position (1, 1). 

E2. If there is a nonzero entry in either row 1 or col- 
umn 1 that is not divisible by an, use it to reduce 
the size of an, as follows: 

E2a, If aij, the entry in row 1 and column j, 
is not divisible by an, then by the di- 
vision algorithm there exist nonzero ele- 
ments q and r such that a\j — qan 4- r, 
with \r\ < |ai,i|. Thus, if q times col- 
umn 1 is subtracted from column j, the 
(l,j) entry will be changed to r, which 
has a smaller size than ai t \. If this entry 
is moved to position (1, 1), the size of an 
will have been reduced. 

E2b. If aii, the entry in row i and column 1, 
is not divisible by an, repeat the proce- 
dure outlined in step E2a, with rows and 
columns interchanged. 

E3. Reduce all the entries in row 1 and column 1 (ex- 
cept ai } i itself) to zero, as follows: 

E3a. Since (by step E2) a xj for j > 2 is divisible 
by an, aij = ^an- Thus, by subtract- 
ing qj times column 1 from column j, the 
(l,j) entry will be transformed to zero. 
Repeating this step for j = 2, . . . , r, the 
first row will “zero out.” 

E3b. Repeat step E3a with rows and columns 
interchanged. 

E4. If there is a nonzero entry in A y say a^j , which is 
not divisible by a n , add column j to column 1. 
(This produces a nonzero entry in column 1 that 
is not divisible by an.) Return to step E2. 

E5. Stop. The matrix A now has the desired property. 
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In the Smith algorithm itself, one applies subalgo- 
rithm E successively to the original matrix G, then to 
the matrix obtained from G by deleting row 1 and col- 
umn 1, then by deleting row 2 and column 2, etc., until 
either an all-zero matrix is encountered, or until all rows 
and columns have been processed. The result is a k x n 
diagonal matrix T = diag(7 j, . . ., 7 *), whose diagonal en- 
tries are the invariant factors of G . The following example 
illustrates the Smith algorithm, when the underlying Eu- 
clidean domain is the ring of polynomials over the field 
GF(2), in which the “size” of an element is its degree. 

Example 1. Consider the following 2x4 matrix with 
entries that are polynomials over GF( 2): 


r _ fl 0 0 0 \ 

5 \0 1 + D + D 2 D + D 2 + D 3 1 + D 3 ) 

At this point, all of the remaining entries in row 2 are 
divisible by the (2,2) entry: D+D 2 +D 3 = D\\ + D+D 2 ), 
and 1 -f D 3 = (1 4- D) ■ (1 4- D + D 2 ). Thus by adding D 
times column 2 to column 3, and then 1 + D times column 2 
to column 4, compute successively 

r _ fl o 0 0 \ 

6 VO 14- D + D 2 0 1 + D 3 ) 


fl 0 0 0\ 

Vo i + d + d 2 o oy 


fl l + D+D 2 1 + D 2 1 + D\ 
\D l + D+D 2 D 2 1 ) 


Since the lowest degree term already appears in the (1,1) 
position, skip step El. Since the (1,1) entry is 1, every 
nonzero entry in row 1 and column 1 is divisible by the 
(1,1) entry, so skip step E2 as well. Executing step E3: 
“zero out” the first row by adding l + D + D 2 times col- 
umn 1 to column 2, 1 4- D 2 times column 1 to column 3, 
and 1 -h D times column 1 to column 4, thereby obtaining 
successively 


Gx = I 

(A 

0 

1 + D 3 

Gi = | 

(A 

0 

1 + D 3 

g 3 = 1 

f 1 

0 

[d 

1 + D 3 


1 + D 


0 0 
4- D 3 D + D 2 + D 3 l + D + D 2 


To zero the entry in position (2, 1), add D times row 1 to 
row 2, thereby obtaining 


fl 0 0 0 

VO 14-Z > 3 D+D 2 + D 3 l + D + D 2 


This completes the operation of subalgorithm E on the 
original matrix G, Now apply subalgorithm E to the 1x3 
matrix obtained by deleting row 1 and column 1 from G4. 
The lowest degree term in G4 appears in position (2,4), so 
by interchanging columns 2 and 4, it is moved to position 
( 2 , 2 ): 


This completes the operation of the Smith algorithm on 
G. The matrix G7 is the invariant-factor form for G; in 
particular, the invariant factors of G are 71 = 1 and 72 = 
l + D + D 2 . | 

III. The Extended Smith Algorithm 

The previous section showed, in computational detail, 
how the Smith algorithm works. Beginning with the ma- 
trix Go = G, it produces a sequence of k x n matrices Gi, 
where Gi+ 1 is derived from G t by either an elementary 
row operation or an elementary column operation. This is 
represented algebraically as 

Gi+i = Ei+iGiFi+i (1) 

where Ei+\ and 1 are k x k and n x n elementary ma- 
trices, respectively. If G,-+i is obtained from G t via a row 
operation, then E{+i is a nontrivial k x k elementary ma- 
trix, but Ft-j-i = If G{+i is obtained from G, via a 
column operation, then F l + i is a nontrivial n x n elemen- 
tary matrix, but E { + 1 = /*. After a finite number N of 
steps, Gn — T. 

The extended Smith algorithm builds on the Smith al- 
gorithm. Whereas the Smith algorithm works only with 
the sequence G 0 , G\ , . . . , Gtv> the extended Smith algo- 
rithm also works with a sequence of unimodular k x k 
matrices Xq, . . . , Xs , and a sequence of unimodular n x n 
matrices To, , Y ]y • The sequences (X,) and (Y*) are ini- 
tialized as Xo = h and Y 0 = I n) and updated via the rule 
[cf. Eq. (1)] 


X.-+1 — Ei+iXi 


( 2 ) 
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Yi+i = YiF <+ 1 


( 3 ) 


If Eq. (4) is specialized with i = N , 


The following simple lemma is the key to the extended 
Smith algorithm. 

Lemma. 

XiGYi = Gi for i = (4) 


Proof. Since A'o = Ik and Vo — In, Eq. (4) holds for 
i = 0. Assuming now that Eq. (4) holds for a given value 
of i, multiply both sides of Eq. (4) on the left by E i+ 1 and 
on the right by F,-+i 

Ei+iXiGYiFi+i = E i+1 GiF i+1 (5) 

By Eq. (1), the right side of Eq. (5) is equal to Gj+i- Thus 

(E i+l Xi)G(YiFi + i) - G i+ 1 (6) 

But Ei+iXi = Aj+i by Eq. (2) and — Ti+i by 

Eq. (3), so Eq. (6) becomes simply X.+iGYi+i = G,+ i, 
which proves Eq. (4) by induction. | 


x n gy n = r (7) 

which is the desired extended Smith diagonalization of G. 
A convenient way to implement the extended Smith al- 
gorithm is to extend the original matrix G to dimensions 
(n + ifc) x (n + k) as follows: 



where in Eq. (8) 0„ x * is an n X k matrix of zeros. Then 
if the sequence of elementary row and column operations 
generated by the Smith algorithm applied to G is per- 
formed on the extended matrix G ' , after i iterations, the 
resulting matrix G\ has the form 



Thus, after N steps, the matrices X jv and Y}v in Eq. (7) 
appear as the upper-right and lower-left blocks of G ' N , re- 
spectively. All this is illustrated by extending the example 
from Section II. 


Example 2. Begin as in Example 1, with the following 2x4 matrix over GF{2)[D]: 


f 1 l + D + D 2 1 + D 2 l + D\ 
\D l + D + D 2 D 2 1 / 


Then the corresponding matrix G' [cf. Eq. (8)] is 


G' = G' 0 


(\ l + D+D 2 1 + D 2 
{ D l + D + D 2 D 2 


1 0 0 
0 1 0 
0 0 1 
0 0 0 


1 + D 1 0\ 

1 0 1 

0 0 0 

0 0 0 

0 0 0 

1 0 0 / 


In Example 1, Gi, G 2 , and G 3 are obtained from G 0 by successively adding l + D + D 2 times column 1 to column 2, 
1 + D 2 times ’column 1 to column 3, and 1 + D times column 1 to column 4. If the same sequence of operations is 
performed to G' 0 , one obtains successively 
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/I 0 1 + D 2 1 + D 1 0 \ 

D 1 + D 3 D 2 1 0 1 1 

r , = 1 1 + D + D 2 0 0 00 

1 0 1 0 0 0 0 

0 0 1 0 0 0 

Vo 0 0 1 0 0/ 


/I 0 0 1 + D 1 0\ 

D 1 + D 3 D + D 2 + D 3 1 0 1 1 

r , _ 11 + D + D 2 1 + D 2 0 0 0 

2 0 1 0 0 0 0 
0 0 1 0 0 0 

Vo 0 0 1 0 0/ 


/I 0 0 0 1 0\ 

D 1 + D 3 D + D 2 + D 3 1 + D + D 2 0 1 
, _ 11 + D+D 2 1 + D 2 1 + D 00 

3_ 0 1 0 000 

0 0 1 0 0 0 

\0 0 0 100 / 


Next, adding D times row 1 to row 2, as was done to obtain G a from G 3 , obtain 


/I 0 0 0 1 0\ 

0 1 + D 3 D + D 2 + D 3 1 + D + D 2 D 1 

r , _ 11 + D+ D 2 1 + D 2 1 + D 0 0 

4 0 1 0 0 0 0 

0 0 1 0 0 0 

Vo 0 0 100/ 


Interchanging columns 2 and 4, obtain 


/I 0 0 0 1 0\ 

0 1 + D + D 2 D + D 2 + D 3 1 + D 3 D 1 
r , _ 1 1 + D 1 + D 2 1+D + D 2 0 0 

5 0 0 0 1 0 0 

0 0 1 0 0 0 

Vo 1 0 000/ 


Finally, adding D times column 2 to column 3, and 1 + D times column 2 to column 4, one computes successively 


/I 0 0 0 1 0\ 

01 + D + D 2 0 1 + D 3 D 1 ' 

r , _ 1 1 + D 1 + D 1 + D+D 2 0 0 

6 0 0 0 1 0 0 

0 0 1 0 0 0 

Vo 1 D 0 0 0/ 


and 
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G' 7 = 


'i 

1 

0 

0 

\ 0 


0 

1 + D + D 2 

l + D 

0 

0 

1 


0 

0 

l + D 
0 
1 

D 


0 1 0 \ 

0 D 1 

D 0 0 

1 0 0 

0 0 0 

l + D 0 0/ 


Thus the extended Smith decomposition of the original matrix G is given by [cf. Eq. (7)] 



(l 

l + D 

l + D 

D \ 


f 1 0 

) c 

0 

0 

0 

1 1 ( 

O 

O 

O 

\D 1 

0 

0 

1 

0 “ (1 

0 1 + D+D 2 0 0 j 



Vo 

1 

D 

l + D/ 



In Section V, the decomposition of Example 2 will be used in the analysis of the (4,2) convolutional code generated by 
G, as an illustration of the general results to be expounded upon in Section IV. 


IV. Application to the Analysis of 
Convolutional Codes 

In this section, given an arbitrary generator matrix G 
for an (n } k) convolutional code C, one can efficiently find, 
among other things: a basic generator matrix, say G bask 
for G; a polynomial inverse for Gbasic; a minimal generator 
matrix for G; and a basic generator matrix for the dual 
code G x . The central tool needed to do all this is the 
extended Smith algorithm introduced in Section III. 

Assume then that G is a k x n generator matrix for 
an (n, k) convolutional code over the field F, i.e . , a k x n 
matrix of rank k over the field F(D ) of rational functions 
over F. By multiplying the ith row of G by the least com- 
mon multiple of the denominators in that row, one easily 
obtains an equivalent generator matrix, all of whose en- 
tries are polynomials over F, i.e., elements of F[D\. Since 
F[D] is an Euclidean domain, one may apply the extended 
Smith algorithm described in Section III, thereby obtain- 
ing a decomposition of Eq. (7). In what follows, the matri- 
ces Xs and produced by the extended Smith algorithm 
will be denoted simply by X and 7. 

The matrices X, Y, and T contain much valuable in- 
formation about the code C and the generator matrix G. 
To extract this information, however, first define several 
useful pieces of these matrices, called IT, IT, K } and H 
(in what follows, r = n — k) 


t k =j k T k 1 = diag(7jt/7i,...,7*/7ifc) 

(dimensions: k x k) (11) 

K = leftmost k columns of Y 

(dimensions: n x k) (12) 

H = rightmost r columns of Y 

(dimensions: n x r) (13) 

The following theorem describes the useful outputs of 
the extended Smith algorithm when applied to G. (For 
terminology not fully explained here, refer to Forney [2].) 

Theorem 1. With the matrices T*, IT, 7v, and H 
defined as in Eqs. (10)— (13), one has the following: 

(a) A basic generator matrix for C: G basic = Fjj^XG. 
(That is, Gbasic is obtained by dividing the zth row of 
XG by the invariant factor 7 », for i = 1 , . . . , k.) 

(b) A polynomial inverse for Gbasic- K. 

(c) A polynomial pseudo-inverse for G, with factor 7*: 
AT*X. (In particular, if G is already basic, i.e., if 
r * = /*, then KX is a polynomial inverse for G.) 


r* = leftmost it columns of T — diag(7 i, . . . ,7 *) 

(dimensions: k x k) (10) 


(d) A basic generator matrix for C 1 - , i.e., a parity-check 
matrix for C: H T . 
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Proof. From Eq. (7) with X = and Y = Yn> one 
has 


XG = TB 


(14) 


where B = Y ~ 1 . Now 5 is an nxn unimodular polynomial 
matrix. Let U be the k x n matrix consisting of the first 
k rows of B , and L be the r x n matrix consisting of the 
last r rows of B . Then, since T = ( IT Okxr ) where Ojfc Xr 
is a k x r all-zeros matrix, it follows that 

i\B = (r* o* xr )(5Q=r k u (15) 


so that, combining Eqs. (14) and (15) 


If now J f = 



where J f 0 is k x k and J[ is r x fc, 


it follows from Eq. (19) that 7 q = *. But since J f 

is a polynomial matrix, and = diag^j" 1 , . . . , j^ l ) } it 
follows that ip must be a multiple of 7 *. Conversely, if 
ip = 7 *, so that Jq — 7^^ 1 = T* and 7( = 0 rx *, then 



Thus, 7 = Y J'X = AT* A is a polynomial 


pseudo-inverse with factor 7 *, which proves Theorem 1(c). 


To prove Theorem 1(d), one has to show that the set 
of codewords in the code generated by G , i.e., the set of 
n-dimensional F(Z})- vectors y of the form y = xG , is iden- 
tical to the set of vectors y such that yH — 0 . Thus, let 
Y\ — {y \ y — xG } and Y 2 = {y : yH = 0). It will be 
shown that Yi = Y 2 . 


C/ = rt 1 XG ( 16 ) 

From Eq. (16), Ef is row equivalent to G. Furthermore, 
since U — ( 0jfc Xr ) J9, and B is unimodular, it follows 
that the invariant factors of U are all equal to 1 , and so 17 is 
a basic generator matrix for G. This proves Theorem 1 (a). 

To prove Theorem 1(b), use the fact that Y and B are 
inverse matrices, so that 


Since, as in Theorem 1(a), the matrix <7, defined to be 
the first k rows of B = Y~ l , is (a basic generator matrix) 
equivalent to G, it follows that Y\ = {y : y = xU}. Thus, 
if y G Yi, y = xU for some x and so yH — ( xU)H =• 
x(U H ) = 0, since by Eq. (17), UH = 0fc Xr . Thus, Y\ C Y 2 . 
On the other hand, if y is arbitrary, then since B~ l = Y, 

y = yI n = y(YB) = y({K tf) (^)) 


(U\ ft , (UK UH\ = y( R 'U + HL) = (yK)U + (yH)L (20) 

UJ (A h)= {lk lh) 


= /» 


f Ik 

\0 rX Jb Ir J 


(17) 


Now suppose y G Y 2 , i.e., y/f = 0. Then from Eq. (20) 
y = (y/\){7 = xU, where x — yl \ , and so y G Yi. Thus, 
Y\ CY 2 , which completes the proof of Theorem 1 (d). | 


It follows from Eq. (17) that UK — /*, so that AT is a 
polynomial inverse for the basic generator matrix U } which 
proves Theorem 1 (b). 

To prove Theorem 1 (c), suppose that 7 is an n x k 
polynomial pseudo-inverse for G with (polynomial) factor 
ip(D) } i.e., that GJ = iplk- Then 

XGYY-'JX- 1 = XGJX‘ l = (18) 


In the next section these results will be briefly illus- 
trated with a simple example. 


V. An Example Convolutional Code 

To illustrate the results of this article, consider the fol- 
lowing generator matrix G for a binary (4, 2 ) convolutional 
code: 


But from Eq. (7), XGY = T = (T * 0* xr ). Thus, if 

the matrix Y~ l JX~ l is denoted by J', 7' is a polyno- G = 

mial matrix (since X and Y are unimodular) and Eq. (18) 

becomes 

In Section III the extended invariant-factor decomposition 
( f* 0)7'= (19) of G was found to be A r GY = T, where 


f 1 l + D + D 2 1 + D 2 1 + D\ 
1 + D-fD 2 D 2 1 ) 
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(1 0 0 0 \ 

VO l + D + D 2 0 OJ 

x= (d !) 


1 + D 

1 + D 

D 

0 

0 

1 

0 

1 

0 

1 

D 

1 + D 


Thus, the matrices defined in Eqs. ( 10)— ( 1 3) are as follows: 


r* 


( 1 o V 

VO l + D + D 2 ) 


ft 


( l + D + D 2 0) 

V 0 1/ 


K 


( 1 00 o) 7 

V 1 + D 0 0 l) 


fl + D 0 1 D V 

V D 10 1 + DJ 


Using the prescriptions in Theorem 1, one quickly ob- 
tains the following: 

(a) A basic generator matrix for C : 


Gb a 


= T^XG 


(1 l + D + D 2 1 + D 2 1 + D\ 
\0 1 + D D 1 ) 


(b) A polynomial inverse for Gbasic : 


/I 

0 

0 

Vo 


1 + 
0 
0 
1 



(c) A polynomial pseudo-inverse for G, 
l + D+D 2 : 


with factor 72 = 


l + £>\ 
0 
0 

1 / 


(d) A (basic) generator matrix for C ± : 

zjT _( 1 + D 0 1 D A 

n ~ \ D 10 1 + D J 

Finally, note that in this case neither of the basic gener- 
ator matrices Gbasic or H T is minimal. To minimize them, 
follow the simple algorithm originally described in [2], or 
perhaps more lucidly in Kailath [4, Section 6.3.2] (where 
minimal matrices are, however, called row-reduced). The 
idea is to use elementary row operations on G to reduce 
the degree of the highest degree terms in some row, as 
long as this is possible. For example, to minimize Gbasic? 
multiply the second row by D and add it to the first row, 
obtaining 


KV k X = 


( l 

0 

0 

\d 


G' = 


( 


1 1 

1 + D D 


1 

1 


Since elementary row operations cannot reduce the row de- 
grees further, the matrix G' is a minimal generator matrix 
for the code G. (Indeed, at this point, one can recognize C 
as the d free = 4 partial-unit-memory (4, 2) code first found 
by Lauer [7, Table 1].) To find a polynomial inverse for 
G', note that G' = TG, where T is a unimodular Jc x k 
polynomial matrix, and so K* = I\T ~ 1 is a polynomial 
inverse for G f . In this example, simply multiply the first 
column of K by D and subtract it from the second column , 
thereby obtaining the following polynomial inverse for the 
minimal generator matrix G f \ 


K' = 



To minimize H T , simply add row 2 to row 1, thereby 
obtaining the following minimal generator matrix for the 
dual code: 



1 1 
1 0 


1 

1 + D 


In this case, the dual code is isomorphic to the original 
code. 
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The Forney indices of an (n,k) convolutional code are 
the degrees of the rows of a minimal generator matrix, and 
the degree of the code is the sum of the Forney indices. 
In this example, the degree of both C and C 1 - is 1, and 


the Forney index of both codes is (0,1). It is a general 
theorem that deg C = deg C 1 - (see [2, Theorem 7]), but the 
equality of the Forney indices in this case is more or less 
accidental. 
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The application of two wavelet transforms to image compression is discussed. 
It is noted that the Haar transform , with proper bit allocation, has performance 
that is visually superior to an algorithm based on a Daubechies filter and to the 
discrete-cosine-transform-based Joint Photographic Experts Group (JPEG) algo- 
rithm at compression ratios exceeding 20:1. In terms of the root-mean-square error , 
the performance of the Haar transform method is basically comparable to that of 
the JPEG algorithm. The implementation of the Haar transform can be achieved 
in integer arithmetic, making it very suitable for applications requiring real-time 
performance. 


I. Introduction 

In an earlier article [1], the author reported on his work 
on the application of a wavelet transform to image coding. 
It was noted that the images processed by this method 
did not suffer from the blockiness which is typical of al- 
gorithms based on the discrete cosine transform (DCT) at 
compression ratios exceeding 20 to 1; however, the edges 
of the objects in an image were blurred and created an 
unsatisfactory visual impression. Two approaches were 
taken to overcome this problem. The wavelet transform 
used in [1] was essentially one dimensional in nature, and 
it was applied along rows and columns of the image. In 
the first approach, it was surmised that the reason for 
the blurriness of the edges was that the application of the 
transform along rows and columns did not properly take 
advantage of the proximity of the pixels. Several modi- 
fied versions of this transform reflecting the contiguity of 
the pixels were tested. Improvements in the edges and 
the general quality of the processed images were observed 
for certain modified transforms. The second approach was 
based on a two-dimensional Haar transform. The edges 
and the general quality of the processed images were in 


general superior to those of the first method and to those 
of the Joint Photographic Experts Group (JPEG) DCT- 
based algorithm at compression ratios exceeding 20 to 1. 
In terms of the root-mean-square error (RMSE), the per- 
formance of this method is essentially comparable to that 
of the JPEG DCT-based algorithm. This is remarkable, 
since the quantization and bit-allocation algorithms used 
in the tests reported here were of a much cruder nature 
than those utilized in the DCT-based approach. While 
there is some blockiness in images processed with the Haar 
transform method, it is not nearly as severe as in images 
processed with the DCT-based algorithm, and the edges 
appear as sharp as those obtained by the latter method. 

In the application of wavelet transforms, one assumes 
that the wavelet transform coefficients are encoded by an 
entropy encoder. This method is analogous to algorithms 
based on other transforms such as DCT. An advantage of 
the Haar transform approach is that both the algorithm 
and its inversion can be implemented in integer arithmetic 
and are very suitable for applications where real-time per- 
formance is required. 
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II. A Daubechies Filter 

The work in [1] was based on the filter T defined by 
the matrix 


/«0 

1 

« 2 

« 3 

0 

0 

0 



0 \ 

<*3 

-0(2 

a i 

-a o 

0 

0 

0 



0 

0 

0 

a o 

a 1 

a 2 

«3 

0 

... 


0 

0 

0 

a 3 

-0(2 

«i 

-a 0 

0 



0 

«2 

3 

0 

0 




0 

<*0 

<*1 

\<*1 

-t*o 

0 

0 




0 

a 3 

-a 2/ 


where 

1 + y/3 3 + >/3 

ao_ V2 ’ 4>/2 


weights c* 3 , — a 2 , aa, and —ao is calculated. The sec- 
ond step is the application of the filter T along the even- 
numbered rows after the image is transformed according to 
the procedure of step 1. The hierarchy of the coefficients 
is different from that described in [1]. After step 1 of the 
first iteration of the two-step process, the coefficients in 
odd-numbered rows are placed at the lowest level of the 
hierarchy. The coefficients in the even-numbered rows are 
then transformed according to step 2, and those coeffi- 
cients corresponding to odd-numbered columns are placed 
immediately above the lowest level. The next iteration 
is the application of the same procedure to the subgrid 
of points with even coordinates. Evenness and oddness 
are then replaced by divisibility by 4 and congruence to 
2 mod 4. The procedure can be repeated in the obvious 
manner. In the terminology of [1], the coefficients at pixels 
with coordinates divisible by 2 r form the projection of the 
data from £ r _i to £ r . Figure 2 is an image of peppers 
that was processed by this algorithm. The compression 
ratio is 27:1. 

III. The Haar Transform 

The Haar transform is defined by the matrix 


3-^3 1 - yj% 

° , = -QT< = 

This filter, the discrete version of one of Daubechies’ com- 
pactly supported wavelets [2], is one dimensional and was 
applied in the work reported in [1] along rows and columns 
of an image. Instead, it is possible to reorder the pixels 
differently to take advantage of their proximity and then 
apply the filter. There is no unique way of reordering or 
reconfiguring the pixels. A number of experiments were 
carried out with widely different results. Certain config- 
urations led to improvement of the edges, while others 
resulted in significant deterioration of the processed im- 
age. The author does not know of any theoretical pro- 
cedure for obtaining the optimal configuration. The ex- 
periments did, however, yield some insight into what a 
desirable configuration may look like. A method which 
produced improvements in the processed image is a two- 
step application of the filter T , The first step is de- 
scribed in Figure I. Starting with an even-numbered row, 
the weighted average of the pixels with even coordinates 
(2m, fc), (2m + 1 ,£), (2 m } k 4- 1), and (2m + 1, fc + 1) is 
computed by using the weights a 0 , ari, a 2 , and a 3 . This 
weighted average is the transformed value at the pixel 
with coordinates (2m, k). To compute the transformed 
values at (2m + l,ib), the weighted average of the pixels 
(2m, i), (2m + 1,£), (2m,fc-j-l), and (2m + 1, k + 1) with 


1 

1 

1 

1 \ 

-1 

1 

-1 

1 

1 

-1 

-1 

1 

\-l 

-1 

1 

\) 


which is a Hadamard matrix (see [3]). Just as is the case 
with the Daubechies filter T , this is a one-dimensional 
filter which may be applied in different ways to the two- 
dimensional image. The configuration which appears to 
best take advantage of the proximity of the pixels and 
produce the most appealing results visually is shown in 
Fig. 3. The image is subdivided into 2x2 blocks. Within 
each block the pixels are reordered as (2^, 2m), (2&,2m-j~ 
1), (2k + 1,2m), and (2k 4- l,277i 4- 1), and the matrix 
is applied to each block separately to yield the trans- 
formed image which is of the same size. The coefficients 
in the transformed image are separated into two groups 
which were designated in [1] as “smooth” and “detail.” In 
the terminology of [1], those coefficients corresponding to 
the pixels with even coordinates form the smooth group 
(i.e., the image of the projection from £ 0 to C\)> and the 
remaining are the details (i.e., the image of the projection 
from C 0 to £ 0 ). The filter H is then applied to the smooth 
group in the obvious manner, leading to a hierarchy of 
transformed data. Figures 4(a) and 4(b) are two images 
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processed by the Haar transform. The compression ratios 
are 12:1 and 27:1, respectively. Notice that the edges are 
clearly defined, even with a compression factor of 27. 

Notice that the Haar transform is integer valued, and 
therefore its application can be implemented in integer 
arithmetic. Since averaging over four pixel values gen- 
erates large numbers rapidly, it was convenient to divide 
the values by four, which from the implementation point 
of view is easily achievable. Naturally, one loses some res- 
olution in this manner; nevertheless, this approach seems 
to be convenient. 

IV. Bit Allocation and Quantization 

As noted in [1], the best method of quantization is by 
truncation to the nearest integer, irrespective of the distri- 
bution of the coefficients. Bit allocation is done according 
to the desired compression ratio and the level of the hi- 
erarchy in the wavelet pyramid. Figure 5 exhibits RMSE 
versus compression ratio for what, after much experimen- 
tation, appears to be a good choice of bit assignments to 
the wavelet coefficients obtained via the application of the 
filter T as described in Section II. The actual bits assigned 
to each level are given in Table 1 . 

The actual bit allocation affects the compression ratio 
versus root-mean-square error significantly. In the appli- 
cation of the Haar transform, the actual effect of different 
bit-allocation schemes on RMSE is shown in Fig. 6. Each 
circled point in the graph corresponds to a particular as- 
signment of bits to different levels of the hierarchy of the 


wavelet coefficients. It is clear from the graph that cer- 
tain assignments do not produce desirable effects. The 
desirable bit-allocation schemes correspond to the lower 
envelope of the curve. This lower envelope is reproduced 
in Fig. 7. Figure 8 gives a comparison between the Haar 
transform algorithm and the standard JPEG DCT-based 
algorithm on the basis of the root-mean-square error. Note 
that the performance of the two methods is comparable. 
This is remarkable, since the latter approach makes use of 
refined quantization and bit-allocation algorithms, while in 
the tests reported here, rather crude methods were used. 
It is therefore reasonable to expect that with further re- 
finement of the Haar method, it will have superior perfor- 
mance even in terms of RMSE. The actual bits assigned 
to each level of the hierarchy are given in Table 2. For 
practical applications, the tables and the graphs may be 
used as guidelines for bit allocation. 


V. Conclusion 

The Haar transform is a viable alternative to the dis- 
crete cosine transform for image compression. With proper 
bit-allocation, this approach has performance visually su- 
perior to that of the commonly used DCT-based JPEG 
algorithm at compression ratios exceeding 20:1. In terms 
of RMSE, the performances of the two methods are com- 
parable. It is therefore reasonable to expect that with 
further refinement of the quantization and bit-allocation 
algorithms, the performance of the Haar transform will 
become superior to that of the DCT-based algorithm even 
in terms of RMSE. 
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Table 1. The assignment of bits resulting 
in the graph shown in Fig. 5. 


Compression ratio 

Bit allocation 

20.7 

2, 2,4,4, 6, 6 

25 

2, 2,3, 3, 6, 6 

29.7 

2,2, 3, 3,4, 6 

36.2 

0,2, 3, 3, 4, 5 

42.8 

0,2, 3, 3, 4, 5, 6, 6 

46.8 

0,0,3, 3, 4, 4, 6, 6 


Table 2. The assignment of bits resulting 
in the plots in Fig. 8. 

Compression ratio 

Bit allocation 

9.1 

4,6,8 

24.2 

2, 4, 6, 8 

27.1 

0,4, 6, 8 

29.2 

0,4, 6, 6 

35.5 

2, 3, 5, 7 

37.3 

2, 3, 5,6 

39.7 

2, 3, 5, 6, 6 

42.0 

0,3, 5, 7 

43.3 

2,3, 4, 6 

43.6 

0,2, 6, 6 

48.1 

0,3, 5, 6, 6 

49.0 

2, 2, 4, 7 

56.7 

2, 2, 3, 7 

57.5 

0,2, 4, 8 

62.4 

0,2, 4, 7 

67.9 

0,2, 4, 6 
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Uq «2 u 3 u -| 

2k 2k + 1 2k + 2 2/C + 3 2k + A 2*+5 

2m m m • • • • 



2m + 2# • • • • • 

Fig. 1. Step 1 of reconfiguration. 



Fig. 2. An image of peppers that was processed by a transform 
based on the Daubechies filter. 


2k 2/(+1 2k + 2 2k + 3 

2m m — — ► • • ► • 



2m + 3# ► # # ► • 


Fig. 3. Configuration of pixels for the Haar 
transform. 




Fig. 4. Images processed by the Haar transform when (a) the 
compression ratio Is 12:1 and (b) the compression ratio is 
27:1. 
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ROOT-MEAN-SQUARE ERROR ROOT-MEAN-SQUARE ERROR 



Fig. 5. Root-mean-square error versus compression ratio for 
good bit assignments. (See Table 1 for actual bits assigned.) 



COMPRESSION RATIO 


Fig. 6. The effect on root-mean-square error of different bit- 
allocatfon schemes resulting from application of the Haar trans- 
form. 




Fig. 8. Root-mean-square error comparison of the Haar trans- 
form and JPEG DCT-based algorithms. 
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Failure Monitoring in Dynamic Systems: Model 
Construction Without Fault Training Data 
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Advances in the use of autoregressive models , pattern recognition methods , and 
hidden Markov models for on-line health monitoring of dynamic systems (such as 
DSN antennas) have recently been reported. However , the algorithms described in 
previous work have the significant drawback that data acquired under fault condi- 
tions are assumed to be available in order to train the model used for monitoring 
the system under observation. This article reports that this assumption can be 
relaxed and that hidden Markov monitoring models can be constructed using only 
data acquired under normal conditions and prior knowledge of the system char- 
acteristics being measured. The method is described and evaluated on data from 
the DSS 13 34-m beam waveguide antenna. The primary conclusion from the ex- 
perimental resu/ts is that the method is indeed practical and holds considerable 
promise for application at the 70-m antenna sites where acquisition of fault data 
under controlled conditions is not realistic. 


I. Introduction and Background 

In previous articles, the problem of on-line health moni- 
toring of a dynamic system (in particular, a DSN 
34-m beam waveguide [BWG] antenna) has been inves- 
tigated [1-3]- The problem can be stated in the follow- 
ing simple manner: let the observed data be denoted by 
X(t) = {x(t),x(t-r), . . . ,x(0)} where each of the x(t) is a 
fc-dimensional vector measurement of sensor data sampled 
at discrete time intervals r. Given X(t), the problem is 
to determine the most likely current state of the system 
at time where the system is assumed to be in one of 
m states {u>i , . . . ,w m }. The states are unobservable di- 
rectly, but can be inferred from the observable data 2L(t)- 
In probabilistic terms, the modelling goal is to accurately 
model p[u>i(t)\2L{t)] (either from prior knowledge, train- 
ing data, or a combination of both), while for prediction , 
p[u>i(t) \K(t)] is us ed to predict the current state given a 


specific set of data X(*) for which the system state is un- 
known. Typically uq corresponds to the normal operating 
state of the system, while the other states represent various 
system faults that may occur. The quality of a particu- 
lar model for p[w,-(<)|2C(0] can be obtained by measuring 
an empirical estimate of the prediction accuracy ) which is 
simply the percentage of time that the state predicted by 
the model agrees with the true state — the test is performed 
over a period of time where the system cycles through var- 
ious states (not known to the model) using data that are 
independent of those on which the model was trained. 

In [1] and [2], an autoregressive-exogenous (ARX) time 
series model coupled to a pattern recognition component 
was used as the basis for estimating p[wi\x(t)]. This is a 
relatively simple model providing state estimates based 
only on instantaneous measurements x{t) but ignoring 
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past data. This model resulted in prediction accuracies of 
about 90 percent on independent test data sets obtained 
at DSS 13. A significant improvement on this method 
was reported in [3] whereby the past data were used in 
the state estimates by embedding the problem in a hid- 
den Markov model (HMM) framework. The key point of 
the HMM method is that prior knowledge regarding the 
temporal behavior of the states can be used to effectively 
model temporal correlations in the system at the state 
level. On-line tests of this method at DSS 13 in Novem- 
ber 1991 resulted in no prediction errors during a 1-hr test 
with state estimates being provided by the model every 
6 sec [3], 

It should be pointed out that the autoregressive and 
hidden Markov modelling methods are not the only ap- 
proach for the fault detection problem. In [4] a number of 
statistical change detection methods were investigated. It 
was found that change detection methods require signifi- 
cant prior knowledge of the behavior of parameter char- 
acteristics when the system enters a fault state. In prac- 
tice this type of detailed prior knowledge is unlikely to 
be available, limiting the applicability of these methods in 
practice. 

II. Limitations of Previously Reported 
Methods 

While the models described in [1-3] display useful ca- 
pabilities in terms of on-line fault detection, they suffer 
from two major limitations: 

(1) The models assume that the known states are ex- 
haustive, i.e., the set of states {u/i, . . . ,u/ m } covers 
all possible states in which the system may be. 

(2) The models also require that labelled training data 
are available for each state, i.e., for each state w,* 
there is a set of data {x(<),^(t — r), . . . , :r(0)} which 
was measured when the system was known to be in 
state 

Clearly both of these requirements cannot be satisfied in 
most real-world fault detection applications. For fault de- 
tection, the assumption that all system states due to faults 
can be specified in advance is clearly inappropriate except 
for the simplest of systems — real-world systems (such as 
DSN antenna pointing systems) often contain large num- 
bers of interacting components with feedback and non- 
linearities, making prior prediction of all possible system 
behaviors under fault conditions unrealistic. However, it 
should be pointed out that it is usually possible to model 
system behavior under a small set of likely system faults — 
this point will be expanded upon later in this article. 


The second requirement, that training data are avail- 
able for each possible system state, is coupled to the first 
assumption: if all possible states cannot be described in 
advance, then the notion of having training data for such 
states is moot. However, even if the first assumption were 
satisfied and all fault states could be described in advance, 
the requirement that data can be recorded when the sys- 
tem is in each of these states is often unrealistic. A good 
example is a DSN 70-m antenna where hardware simula- 
tion of fault conditions is not a practical option due to 
operational considerations (as compared to the DSN 34-m 
antenna at DSS 13). 

Hence, there is considerable practical motivation to de- 
velop methods that relax the assumptions on which the 
earlier-reported models are based, while still retaining the 
accurate prediction capabilities of these models. This arti- 
cle describes a relatively simple yet effective method that 
can detect the presence of states for which no training data 
sets were available, i.e., states about which the model has 
no knowledge. It is assumed that training data (or else a 
strong prior model) for at least one state is available — this 
is not restrictive since data under normal conditions are 
almost always available. The proposed method is based on 
the use of prior knowledge to constrain the possible distri- 
bution of system parameters, which when coupled with the 
model derived from the training data, allows detection of 
both known states and a generic, unknown state category. 

This article outlines the general model, illustrates its 
use and effectiveness on data collected from the elevation 
axis of the DSS 13 BWG antenna pointing servomech- 
anism, and describes the limitations of the current ap- 
proach. 

III. Notation and Assumptions 

For the purposes of this article, the distinction is made 
between the observable data at time t which is x(f) and 
the estimated parameters at time t ) denoted by the vector 
0(t ). Typically x(t) is the original sensor data or time 
series (such as the motor current in an antenna pointing 
system), whereas the values of 0(t) are typically statistical 
estimates of some characteristics of the time series such as 
the mean, variance, or autoregressive (AR) coefficients. In 
this article, attention will be limited to block estimation 
methods whereby 0(t) = f\z{t), x(t - r), . . . , x(t - Nr)], 
etc. Hence, each of the parameter estimates is derived from 
disjoint windows or blocks of the original data, where N 
is chosen to be large enough to enable reasonably reliable 
statistical estimates. 

Let = {#(*),#(< — Nr), . . . , 0(0)}. In effect, is 
then viewed as the observable data sequence and the prob- 
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lem can be treated as that of recovering the likely system 
states given the estimates i.e., find p[u>; (£)!<£*]. Issues 
such as choosing appropriate estimators, block sizes, etc., 
will not be dealt with in this article. For the experimen- 
tal results reported later in this article, values of r = 20 
msec and N = 200 are used. However, for the purposes of 
simplification of notation, it will be assumed without loss 
of generality that Nr = 1 during the development of the 
probabilistic models that follow. 

It is also assumed that there are m — I states for which 
prior information is available either in the form of: (1) spe- 
cific parametric models for the dependence of the states 
on the observable data, or (2) training data. An addi- 
tional mth state is used in the model as a single state 
which accounts for all other possible behaviors of the sys- 
tem that are qualitatively different from the known states. 
This state will be referred to as the unknown fault state. 
Hence, in the simplest case, for example, if prior informa- 
tion is only available for the normal state, then the model 
has two states: normal and the unknown fault state. 


IV. The General Model 

The goal of the modelling process is to provide a means 
of estimating the posterior state probabilities 


pN(0I*<] = pM0I£(<),£(< -!)>•■■ ,£(0)] 


1 < i < m 


(i) 


which are required for prediction. In the Appendix it is 
shown how the hidden Markov framework can be used such 
that the full number of conditioning terms in Eq. (1) is not 
necessary if the appropriate assumptions are met. The 
hidden Markov model leads to recursive estimates of the 
form 


p[ w , •(<)!£(<),£(< ,m] » pBMMOJ 


X 5Z/(p[wj(< - !)l£(* - 1). • • • *£(°)]) ( 2 ) 

i = 1 V 7 

so that knowledge of the likelihood p[0($)|a;i(<)] at each 
time t (in addition to the Markov transition matrix A) is 
sufficient to calculate the posterior estimates. 

Note that it will be assumed that the statistics of inter- 
est are time-invariant, hence reference to a specific time t 
can be dropped at this point. 


In previous work, direct forward models of p(u>j|£) were 
estimated and then was estimated by the use of 

Bayes’ rule in Eq. (2) [3]. In this article, it is proposed 
to use models of the form p(0]cji) as the direct basis for 
the model. The rationale behind this approach is simple: 
based on prior knowledge alone it is impossible except in 
simple cases to specify the form of However, it is 

much more likely that one can model the dependence of 
the data on the state, i.e., a prior density can be assigned 
to the likelihood p(9 based on prior knowledge. In 
particular, for state w m , which is the state that covers 
all possible states not included in the set {wx, . . . 
one can typically specify a noninformative uniform prior 
density over the set of possible parameter values for 0. In 
addition, one must also supply models for p(#|u> t ), 1 < i < 
m— I, which are typically estimated from training data. 

The key difference between this method and those 
methods proposed in previous literature is that the model 
works with likelihoods (the probability of the observable 
data given the states), rather than directly with the pos- 
terior state probabilities (the probabilities of the states 
given the data). This approach rules out the use of many 
discriminant-based methods that only provide estimates of 
the posterior probabilities, but do not provide estimates 
of the likelihoods (for example, logistic regression, feed- 
forward neural networks, decision trees, etc.). Methods 
that provide the required estimates include (naturally) 
both parametric methods (such as maximum likelihood 
classifiers based on a specific parametric form for p(0]ui) 
and nonparametric methods such as kernel density esti- 
mators. For a more extensive general discussion of the 
differences between such models, see [5-7]. 

The proposed method can be summarized as follows: 

(1) Specify or estimate prior density models, p(£|u>;), 

for the known classes, u?j , . . . As mentioned 

above, this requires the use of either a parametric 
model (such as a multivariate Gaussian) or a non- 
parametric density estimation method. 

(2) Specify a prior density for p(6\u) m ) where u m is the 
special unknown state. This is typically done by es- 
tablishing bounds or constraints on each of the pa- 
rameters in 0 and then (in the absence of any other 
information) specifying a uniform density over the 
bounded parameter space. 

(3) The remainder of the method is the same as before: 
simply estimate the hidden Markov model param- 
eters from reliability data (as described in the Ap- 
pendix) and run the model for prediction. 

Note in step (2) it is important that the derived param- 
eters can be bounded in some manner. The stronger the 
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constraints that can be placed on the parameters, the bet- 
ter will be the detection performance of the model. These 
constraints could be due to the basic physics of the system, 
such as energy limits, or a function of the particular rep- 
resentation being used, such as spectral or autoregressive 
estimates (a specific example is provided in Section V). If 
there are no constraints at all, then it is still possible to 
specify a prior model, such as a Gaussian model, although 
the choice of model may now be somewhat more problem- 
atic since it will inevitably reflect a prior bias which may 
not be appropriate. A better alternative (in the case of no 
constraints) would be not to use a prior model at all for 
and just detect data which appear to be outliers from 
the other m — 1 models. However, outlier detection can 
be problematic — it is a central theme of this article that 
prior constraints can usually be placed on the parameter 
space of interest and that this provides the natural avenue 
for detecting data from w m . In essence, it is argued that 
if such prior constraints exist, this information should be 
used in the model, and should in principle provide better 
detection capabilities than any outlier detection method. 

V. Applying the Likelihood Method 
In Practice 

One significant difference between modelling the like- 
lihoods and posterior probabilities is the issue of dimen- 
sionality, namely, that a high-dimensional parameter space 
will be potentially more problematic for the likelihood 
modelling method than for the posterior (or discriminant) 
modelling method. In a discriminant model (which cal- 
culates posterior probabilities), input dimensions can be 
ignored in the model if they are irrelevant to the state, al- 
lowing more efficient estimation at small training sample 
sizes. However, in the likelihood model, all input dimen- 
sions must be included in the model. If there are a signif- 
icant number of irrelevant or redundant inputs, this can 
lead to a poor model, particularly as the ratio of sample 
size to input dimensions gets small. Hence, parsimony in 
parameter choice is recommended. 

From previous work with the DSS-13 BWG-antenna 
pointing system, it has been found that autoregressive co- 
efficients and standard deviation estimates are both par- 
ticularly useful characteristics of the motor current for the 
purpose of detecting abnormal events [2,3]. In this arti- 
cle, three such characteristics as estimated from the mo- 
tor current signal will be chosen: the two coefficients of 
a second-order autoregressive model [AR(2)], <f> \ and fa, 
and the standard deviation, cr. Hence, 6 = (^i.fayfa- In 
[2] and [3], an eighth-order ARX model was used to model 
the motor current signal, using the rate command as the 
forcing term. However, in the interests of keeping the in- 
put dimensionality relatively low, a simpler AR(2) model 


was used for the purposes of this experiment. While the 
simpler model is not appropriate for complete system iden- 
tification, it is sufficient for the purposes at hand to extract 
useful signal characteristics that can be used to discrimi- 
nate between normal and abnormal operating conditions. 

The next step is to specify a prior density over the AR 
parameters <f>\ and fa. In accordance with standard time 
series theory, if the estimated process (as represented by 
the two coefficients) is to be stationary, then the coeffi- 
cients must obey the following restrictions [8]: 

<t>i + fa < I 

fa *“ fa < 1 

-1 < <f>i < 1 

It will be assumed that the estimated coefficients are in 
fact stationary, thus providing bounds on the possible pa- 
rameter values (see Fig. 1). A uniform density is specified 
over all such allowable values of <j> \ and fa. Of course, 
this does not allow for the fact that in practice (and in 
particular for fault conditions) there is no guarantee that 
the estimated coefficients will obey these bounds. The fol- 
lowing approach is adopted: if the estimated coefficients 
lie outside the bounds of the stationary region, then the 
probability of the normal state p(u>i|0) is set to zero. 

The third parameter, the standard deviation of the volt- 
age from the Hall efTect sensor, which measures motor cur- 
rent, is about 20 mV under normal conditions. Based on 
experience from observing the motor current signal under 
a variety of conditions, it is estimated that under any fault 
condition the standard deviation should not exceed 1 V. 
Hence, in the absence of any other prior information, a 
uniform density is placed on the standard deviation over 
this range 0 to 1 V for <7. This density is assumed inde- 
pendent of the AR(2) coefficient density. This completes 
the specification of the likelihood model for u? m . 

For the other m— 1 states, normal and any known fault 
conditions, likelihood models can be found via the use of 
Gaussian assumptions with maximum likelihood parame- 
ter estimation or nonparametric density estimation. 

VI. Experimental Results 

In [2] the acquisition of data at DSS 13 was described. 
Specifically, sensor data were measured under controlled 
conditions from the elevation axis servomechanism of the 
34-m BWG antenna. Data are available for two different 
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days of antenna operation, referred to as day 42 and day 
53. Data were recorded for about 30 min with four differ- 
ent fault conditions present. The faults are: tachometer 
noise, tachometer failure, compensation loss in the ampli- 
fier, and encoder failure. The fourth fault, encoder failure, 
was a real fault that was subsequently repaired. It shows 
up in the data as being intermittent in nature. The other 
three faults were purposely introduced into the hardware 
in a controlled manner. 

The same model for the prior likelihood p(0]uj m ) as de- 
scribed in Section V was used in all experiments. The 
Markov transition matrix was set to have probability of 
0.99998 of remaining in the normal state, which corre- 
sponds to a mean time between failure of about 2 days. 
The probability of transiting to any particular fault state 
was set uniformly, and the probability of remaining in a 
fault state was set to 0.95 (corresponding to a mean failure 
duration of 1 min before shutdown occurred). 

A model was trained on normal data and on one of the 
known faults (the compensation loss fault), giving a three- 
state model (normal, known fault, and unknown fault). 
The normal and known fault likelihood models were con- 
structed using a multivariate Gaussian density where the 
mean and covariance parameters were estimated from the 
data using maximum likelihood estimators. 

Two models were constructed in this manner (one on 
each of the day 42 and day 53 data sets) and then tested on 
the independent data from the other day. The goal of the 
experiment was to see if the model could correctly identify 
data as being either normal, a known fault, or an unknown 
fault. The ability to classify data into the third unknown 
category was of particular relevance, since, as described 
earlier in this article, previously developed models did not 
have this capability, i.e., all data were classified into one 
of the known states. 

The state sequence in each test data set was as follows: 
normal, unknown fault (tachometer failure); known fault 
(compensation loss); normal conditions, unknown fault 
(tachometer noise); and finally an unknown intermittent 
fault (encoder failure). Each state lasted roughly 5 min 
in duration. Figure 2 shows how one of the AR(2) coef- 
ficients changed as a function of the underlying state (for 
day 53). Note how noisy the estimates are, due in part to 
the fact that an AR(2) model is too simple to capture the 
full dynamics of the data. 

Figures 3(a) and 3(b) show the state estimation results 
in terms of estimated state probabilities [as in Eq. (2)] for 
each of the three states in the model. The results clearly 
indicate that the likelihood model has the ability to infer 


the correct state of the system from the observable data. 
As in [3], the Markov model adds stability to the estimates, 
reducing false alarms while still allowing a rapid transition 
when the underlying state changes. 

The important aspect of this new model is its ability to 
identify data as being of the unknown category, namely, 
between minutes 5 and 10, minutes 20 and 25, and the 
intermittent fault that occurred between minutes 25 and 
30. The response of the model is not entirely perfect. For 
example, during the test of day 42 data [Fig. 3(a)], in the 
first 5 min of normal operations, there appear to be at 
least two short false alarms, i.e., where the probability of 
normal conditions drops significantly below 1 even though 
the system is supposed to be in the normal state. This 
can be attributed to one of two possible causes: either 
the model is not quite accurate, or, more interestingly, al- 
though the system is assumed to be normal it is in fact in 
some other transient state. Closer examination of the orig- 
inal sensor data revealed that the second explanation was 
more likely to be true: the model detected the possibility 
of an unknown transient state that had not been noticed 
when these data were originally recorded. While this is a 
relatively simple example, it nonetheless demonstrates the 
basic concept of a model which can detect subtle changes 
and abnormalities in the behavior of a dynamic system — 
changes that are not noticeable to the human observer. 

It is also worth noting that the present model assigns 
a relatively low probability to short a priori states. An 
obvious extension to the model proposed here would be to 
further refine the unknown state into substates based on 
their temporal characteristics, i.e., intermittent or tran- 
sient, or permanent. 

VII. Discussion 

This article has described the basic principles behind 
the construction of dynamic system monitoring models 
that can classify system states into an “unknown” cate- 
gory. Although the basic idea is quite simple, it has some 
very useful properties. In addition, it is worth noting that 
all previous fault monitoring methods described in the lit- 
erature (of which the authors of this article are aware) im- 
plicitly assume that all system states of interest are known 
in advance. For large-scale complex systems, this is clearly 
an undesirable and unrealistic assumption. 

A possible criticism of the proposed method is the pos- 
sibly arbitrary nature by which the prior density for the 
unknown state is assigned. Certainly it must be admitted 
that this can never be a purely objective choice and re- 
quires the careful judgment of the modeller. However, any 
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model is by nature the result of various implicit biases and 
subjective judgments, and, hence the standard argument 
of the Bayesian school of statistical modelling can be ap- 
pealed to: if any reasonable prior information exists, then 
it is judicious to include it in the model. The astute reader 
will have noted that by simply changing the boundaries 
or constraints on the parameters of interest, the modeller 
can in effect control the detection to false alarm trade-off 
characteristic of the model [also known as the receiver op- 
erating characteristic (ROC) in signal detection theory]. 
The use of decision theoretic methods to minimize the rel- 
evant loss function (in the context of choosing the prior 
density) would seem the appropriate avenue by which to 
control this aspect of the model. 

The ability to detect new system states does not come 
without a cost. As alluded to earlier, the mapping describ- 
ing how the observed data depend on the system states 
(the likelihood) is generally more difficult to estimate than 
the mapping describing how the states depend on the ob- 
served data. Hence, for example, in the case where one has 
three known faults, a model of the type which is proposed 
here may not be as accurate in terms of discriminating 
among these faults as the types of discrimination mod- 
els that focus exclusively on these faults but which ignore 
the possibility of an unknown fault. One way to avoid 
this problem is to improve the quality of the likelihood 
modelling process. For example, a Gaussian assumption 
is often not appropriate: nonparametric density estima- 
tion methods, if used correctly, may provide more accurate 
models for the known states. 

Another possibility would be to use both likelihood 
models and discriminative models as part of one overall 
model. Letting p(u? m ) be the posterior probability that 
the data are from an unknown state (as calculated by a 
likelihood model of the type described in this article), and 
letting the symbol uq 1} _ im _n denote the event that the 
true system state is one of the known states, one can es- 
timate the true posterior probability of individual known 
states as 

Pdi^i 1 — 1 } ) X [1 p(^m)]j 1 ^ ^ 771 1 


where pd(u>,|<E,uqi t — 1 } ) is the posterior probability es- 
timate of the known states as provided by a discrimina- 
tive model such as described in [2] and [3]. Note that 
this method does not in any way help to improve the abil- 
ity of the overall model to detect unknown states since 
that estimate remains unchanged; however, in principle, it 
should improve the ability of the model to distinguish be- 
tween specific known states. The possibility of improving 
the model described in this article by using this particular 
technique has not been tested in an experimental manner 
at this point. 

A final comment is that the ability of the likelihood 
model to detect unknown states is necessarily limited by 
the information in the observable data. For example, al- 
though the simple AR models reported here have given 
very useful information in terms of discriminating between 
normal and various fault states, it is quite possible that a 
fault state may not be well modelled by a simple linear 
AR model, i.e,, that the AR coefficients will not yield any 
useful information. Hence, in general, the use of more ro- 
bust signal characteristics should improve the model per- 
formance. 


VIII. Summary 

A new method was proposed that allows the construc- 
tion of HMM monitoring algorithms without the require- 
ment that training data for each of a prescribed set of 
faults be made available. Naturally, if such data (or equiv- 
alent prior knowledge) are available, then these data can 
also be incorporated into the new model. The proposed 
method was validated on data from the DSS-13 BWG- 
antenna pointing system. In particular, the model was 
able to detect system states that could not have been de- 
tected using previously reported methods. While there is 
still room for improvement in terms of the performance of 
this class of models, the results are nonetheless quite accu- 
rate and of significant practical importance in the context 
of monitoring 70-m antenna data where fault training data 
are unlikely to be available. 
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Fig. 1. Admissible region for AR(2) parameters. 
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Fig. 2. Estimates of coefficient <\>2 as a function of the system state (day 53 data). 
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Appendix 

Hidden Markov Model Description 


Let Q denote the discrete-valued state variable tak- 
ing values in the set {u q, . . . , u; m }. A first-order discrete- 
time Markov model is characterized by the assumption 
that 

p[mm -!),•• .,o(o)] =p[n(*)in(i - 1)] (A-i) 

That is, that the conditional probability of any current 
state given knowledge of all previous states is the same 
as the conditional probability of the current state given 
knowledge of the system state at time t — 1. This is 
equivalent to the well-known assumption of “memoryless- 
ness” in that the evolution of the system depends only 
on the present state and not on the past state. A direct 
consequence is the fact that the number of consecutive 
time steps that the system spends in any given state will 
be a discrete random variable with a geometric distribu- 
tion. 

In a standard, nonhidden Markov model, to calculate 
the probability that the system is in a given state at time 
one needs only to know the initial state probabilities t = 
p[u>i(0), . . . ,u; m (0)] and the values = p[wi(t)\wj(t — 1)], 
1 5: h J S m * The m x m matrix A is known as the tran- 
sition matrix and characterizes the Markov model. The 
first-order Markov assumption governing state evolution 
in time may appear restrictive at first glance, but has 
been found in practice to be an extremely robust model 
for many real-world applications. In principle, the theory 
for higher-order Markov models can be developed, but at 
the cost of increased complexity in terms of specifying the 
model and of increased computational complexity in terms 
of on-line calculation of the posterior probabilities. 

Under the first-order Markov assumption, it can easily 
be shown that the probability in which the system remains 
in the normal state from one instant to the next can be 
expressed as 


in the transition matrix A can be estimated from informa- 
tion concerning the general nature of system faults, which 
may be available from an existing database or can be esti- 
mated based on known physical properties of the system. 
Augmented models may have a wide variety of additional 
states. For example, it may be useful to include a state 
to account for the transient behavior of the system. Sim- 
ilarly, states which account for known operational modes 
of the system, such as powered off and brakes on, may 
be necessary in practice. The specification of the Markov 
transition matrix corresponds to the explicit modelling of 
high-level prior knowledge concerning system behavior at 
the state level. In particular, it does not involve the speci- 
fication of prior models for observable data over time since 
typically this is much more difficult to model. This is 
precisely the advantage of the HMM decomposition: the 
temporal behavior of the system needs only to be specified 
at a relatively high level. 

Denote the observed data up to time t to be $ t = 
{#(<), . , . ,£(0)}. The hidden aspect of the Markov model 
is derived from the fact that the observed data <$* is a 
stochastic function of the underlying Markov states. These 
states are hidden in the sense that they cannot be mea- 
sured directly. It is the state identities which one wishes to 
estimate, hence, the purpose of the modelling is to repre- 
sent the relationship between the states and the observable 
data such that the most likely state sequence can be in- 
ferred. Figure A-l shows an illustration of the concept for 
a three-state HMM. For on-line monitoring of a dynamic 
system, the observed data simply consist of observed sen- 
sor data (or derived parameters) while the states reflect the 
underlying system states, in particular normal and fault 
operational states. 

An estimate of the instantaneous likelihood, the prob- 
ability of the observed data at time t conditioned on the 
state variable, |0(^ )] , is assumed to be known. The 

goal is to take advantage of all the symptom information 
and to estimate p[Q(*)|3>*]. It is convenient to work in 
terms of an intermediate variable a, where 


an = 1 - 


r 

MTBF 


(A-2) 


«i(0=PM0,*i] (A-3) 


where the MTBF is the mean time between failures of 
the system and r is the time between states (both ex- 
pressed in the same units). Similarly, the other elements 


To find the posterior probabilities of interest, it is suf- 
ficient to be able to calculate the ct’s at any time <, since 
by Bayes’ rule 
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(A-4) 




1 

pW 


Oi(t) = 


«>(<) 

m 

: = i 


The derivation of a recursive estimate follows: 


a »(0 = - 1 )) = ~ ^ j 

j=i ' ' ;=i v ' 

= - !)W< - 0 

i= i ' y 


by the definition of 


= Ep(i(Ohw)p(^wi^-^j(< - - 1) 


assuming that £(t) is independent of past observations and past states, given the present state 

= - !))«;(* - 1) 

assuming that u>,(t) is independent of past observations given the past states 

= p(fi(OMo) X^ a o' a j(* - 

v ' j=i 


(A-5) 


The first term is the likelihood (assumed to be known). 
The terms in the sum are just a linear combination of the 
a’s from the previous time step. Hence, Eq. (A-5) pro- 
vides the basic recursive relationship for estimating state 
probabilities at any time t. 

The additional assumptions made in the derivation of 
Eq. (A-5) (besides the first-order Markov assumption on 
state dependence) require some comment. The first as- 
sumption is that 0(t) is independent of both the most 
recent state and the observed past data, given that the 
present state is known. This implies that the observed 
symptoms are assumed to be statistically independent 
from one time window to the next, given the state in- 
formation. This will generally be true when the values of 


£(<) consist of derived parameters and r is much greater 
than any significant time constants of the dynamic system. 
Even if it is known that the £’s exhibit temporal correla- 
tions, this can also in principle be modelled directly in 
Eq. (A-5), although the model will now be much more 
complex. The second assumption, that the present state 
only depends on the previous state but not the past ob- 
servations, simply reflects the causal relationship between 
symptoms and states. 

Note that the method described above only calculates 
the state probabilities based on past information. Alter- 
native estimation strategies are possible. For example, us- 
ing the well-known forward-backward recurrence relations 
[9], one can update the state probability estimates using 
symptom information which occurred later in time. 
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A Functional Description of the Buffered 
Telemetry Demodulator (BTD) 

H. Tsou, B. Shah, R. Lee, and S. Hinedi 
Communications Systems Research Section 


This article gives a functional description of the buffered telemetry demodulator 
(BTD), which operates on recorded digital samples to extract the symbols from the 
received signal. The key advantages of the BTD are (1) its ability to reprocess the 
signal to reduce acquisition time, (2) its ability to use future information about the 
signal and to perform smoothing on past samples, and (3) its minimum transmission 
bandwidth requirement as each subcarrier harmonic is processed individually. The 
first application of the BTD would be the Galileo S-band contingency mission, 
where the signal is so weak that reprocessing to reduce the acquisition time is 
crucial. Moreover, in the event of employing antenna arraying with full spectrum 
combining , only the subcarrier harmonics need to be transmitted between sites, 
resulting in significant reduction in data rate transmission requirements. Software 
implementation of the BTD is described for various general-purpose computers . 


I. Introduction 

Current DSN receivers operate on the received signal 
in real time and do not buffer or store the received signal 
for reprocessing. As a result, data lost during acquisition 
or cycle slips is not recoverable. In some DSN missions, 
the link margin is so low that signal acquisition (carrier, 
subcarrier, and symbol) takes too long, resulting in sig- 
nificant data loss. Such a scenario occurs during oper- 
ation at low data rates (on the order of a few hundred 
symbols per second) with rate 1/4, 1/5, and 1/6 convo- 
lutional codes, resulting in symbol signal-to-noise ratios 
(SNR’s) between —4 dB and —6 dB. This combination of 
low symbol SNR and low data rate produces unusually 
long acquisition times dominated by the behavior of the 
sub carrier- and symbol-synchronization loops. The acqui- 
sition time can, in some instances, be as long as 20 minutes 


and depends on the scenario of interest. The problem is 
augmented when suppressed carriers are employed, since 
the Costas loop SNR is a nonlinear function of the symbol 
SNR; significant squaring loss results. When the data rate 
on the spacecraft is programmed to take advantage of the 
antenna aperture on the ground, several data rate changes 
can occur in a pass or a single day, each resulting in data 
loss due to resynchronization in the ground receiver. 

The buffered telemetry demodulator (BTD) overcomes 
these problems by recording the signal and then processing 
and reprocessing the samples. Once acquisition occurs, the 
BTD can operate on the past data, using estimates of the 
signal state (such as phase, frequency, and Doppler rate) 
to basically go back in time and recover those symbols 
lost during acquisition. This is referred to as smoothing, 
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where “future” data are used to provide an estimate in 
the “past.” The optimum strategy on the data playback 
is still under investigation and will be the topic of a future 
report. The BTD is designed mainly for low-data-rate ap- 
plications and can be implemented on a general-purpose 
workstation. In order to reduce the computational require- 
ment, the signal is recorded on a subcarrier harmonic ba- 
sis, thus rendering the BTD independent of the subcar- 
rier frequency. This results in a computational through- 
put requirement based on the data rate rather than on the 
bandwidth of the transmitted signal. In addition, when 
antenna arraying is employed and full spectrum combin- 
ing is utilized, only a fraction of the total signal band- 
width needs to be transmitted between sites, resulting in 
significant reduction in transmission requirements. In this 
article, the BTD is functionally described and its imple- 
mentation on various workstations is evaluated. First, the 
carrier and subcarrier synchronization loops are reviewed. 
Then, the recording of the signal is described; each sub- 
carrier harmonic is individually demodulated to baseband 
and recorded. Finally, the coherent baseband demodu- 
lation is discussed along with requirements for software 
implementation of the BTD. Also, a glossary is included. 


r(t) = V2P cos( A) sin ['F c (t )] 

+ \/2Psin(A)d(<) sgn{sin[^, u (f)]} 

x cos[$ e (t)] + n(t) (1) 

where P c = Pcos 2 (A) and Pd = Psin 2 (A) in watts are the 
received carrier power and data power, respectively; ^ c (f) 
is the total carrier phase; and is the total sub carrier 

phase, both in radians. The data, d(t), are given by 


d(t)= £ a,p(t - IT) (2) 

— CO 

where p(f) is the unit-power square pulse limited to T 
seconds and {a/} represents the independent and equally 
likely binary (±1) symbols. The noise process, n(f), is 
modeled as bandpass with a flat, one-sided power spectral 
density (PSD) level equal to N 0 W/Hz. By expressing the 
square- wave sub carrier as an infinite sum, i.e., 


II. A Review of the Advanced Receiver 
Carrier and Subcarrier Loops 

The advanced receiver II (ARX II) [1] is a receiving 
system that has been developed to digitally demodulate 
and process signals from deep-space spacecraft. It is a 
breadboard system for the Block V receiver, which will 
eventually replace various receivers currently used in the 
DSN [2]. This section briefly reviews the ARX II carrier 
and subcarrier loops so that they can be compared to the 
carrier and subcarrier loops of the BTD. The main differ- 
ence between the ARX II and BTD is that the ARX II 
carrier and sub carrier loops process the received signal af- 
ter it has been open-loop downconverted to an interme- 
diate frequency (IF), whereas the BTD loops process the 
received signal after it has been open-loop downconverted 
to baseband on a subcarrier harmonic basis. All the other 
receiving functions, e.g., symbol synchronization, symbol 
detection, lock detection, etc., are implemented the same 
way in both the ARX II and BTD. Consequently, they are 
not discussed in this article. Instead, the interested reader 
is referred to [1] for a complete review of these functions. 

A block diagram of the ARX II carrier and subcarrier 
loops is shown in Fig. 1. The received signal, which is 
assumed to be a typical deep space signal, consists of a 
residual sinusoidal carrier and a square- wave sub carrier. 
After downcon version to an appropriate IF, it can be rep- 
resented as 


4 1 

sgn{sin[tf ,„(<)]} = - ^ - sin \j ¥,«(<)] (3) 

j = i 3 

j :odd 


the digitized signal, which is sampled at rate R s = 1/T,, 
can be written as 


r(kT,) = v / ^^sin[$ <: (ir > )] 

/ 


+ \/2Pdd{kT,) 


iy; 

\ x ^ 

\ j .odd 


sinp'^upr,)] 


\ 


) 


x cos[^ c (fcT,)] + n(kT » ) 


( 4 ) 


where only up to the Lth subcarrier harmonics ( L is an 
odd integer) are assumed to be present at the analog-to- 
digital (A/D) converter output in Fig. 1. Samples of the 
bandpass noise can be expressed as 


n(kT,) = V2ni(kT,) cos[^ , <; (iT J )] 


-v/^naCitTOsin^e^T,)] (5) 
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where the random variables ni(kT s ) and n 2 (&T,) are sam- 
ples of the baseband noise processes n x (<) and n 2 (/), 
which are assumed to be statistically independent, white 
Gaussian noise processes with a flat, one-sided PSD level 
equal to No W/Hz within a one-sided bandwidth equal to 
l/(2T s ) Hz. As a result, ni(kT s ) and n 2 (fcT,) are zero- 
mean with variance <7^ = cr^ 3 = N 0 /(2T S ). The time 
index kT s will now be omitted to allow a simpler notation. 

The residual carrier tracking loop control signal, U q in 
Fig. 1, is derived by mixing the digitized IF signal with its 
carrier quadrature component, lowpass filtering the mixer 
output, and then accumulating the filtered samples over 
the loop update interval. Mathematically, 

U q = \/P^sm(<l> c ) + ny, (6) 

where <f> c is the carrier phase error, nu q is a zero-mean 
Gaussian random variable with variance No/(2T u ) y and 
T u = MiT s is the loop update interval in seconds. Note 
that the data have been neglected in writing Eq. (6), 
since it is assumed that the accumulator in the carrier 
quadrature arm averages over several subcarrier cycles 
(i.e., f su T u 1) and that the data sum is thus approxi- 
mately zero. It will be shown in Section IV that the BTD 
carrier loop control signal has the same signal power and 
noise statistics as Eq. (6) and, therefore, both loops will 
have the same tracking performance. 

Next, consider the subcarrier tracking loop. As shown 
in Fig. 1, the input to the subcarrier loop is the quadrature 


component of the residual carrier. The subcarrier control 
signal is obtained by forming the product ZiZ qsu , where 
Zi is the output of the sub carrier inphase arm and Z q>su 
is the output of the subcarrier quadrature arm. Note that 
the subscript “su” in Z q>iU is used to distinguish it from 
Z qt5ca) the quadrature arm of the suppressed carrier loop. 
The signals Z, and Z qtSU are obtained after mixing the 
residual carrier quadrature component with, respectively, 
the subcarrier inphase and quadrature references and then 
accumulating these samples over a symbol duration (per- 
fect symbol synchronization assumed). These signals are 
given as [1] 

Zi = y/PdaiFj(<j> su ) cos(^ c ) + nz x (7) 

and 

Zq t su ” s/Pda^M COS(<f> c ) + nZq t su ( 8 ) 

where <j> 3U is the subcarrier phase error and the functions 
Ff(4> $u ) and Fg(0 3U ) are periodic functions of (j> sxl (period 
2?r) given in the principal phase interval [— ir, tt] as follows 
[ 1 ]: 

W, u ) = l--|rf„|, (9) 

7 T 

and 


( (2/n)<f> su I t,u\<(*/2)W §u 


FQ(<f> $ u) 


< sgn (fau)W,u 


(ir/2)W su < |<£ su | < tt(1 ~ W su /2) ) 


( 10 ) 


l 2sgn(<£,„) - (2/ir )<j> su tt(1 - W 3U / 2) < |^,«| < tt 


The parameter W su , in fractions of cycles, is the width 
of the subcarrier window (W 9U < 1, and W su — 1 cor- 
responds to the “no window” case). It is used in the 
quadrature arm to improve the square-wave tracking per- 
formance [3]. The noises nz, and nz q>su in Eqs. (7) and 
(8) are independent with respective variances No/(2T) and 
NoW su /(2 T), where T denotes symbol duration. It will be 
shown in Section IV that the BTD inphase and quadra- 


ture accumulator outputs have the same signal and noise 
statistics as Eqs. (7) and (8). Consequently, both subcar- 
rier loops will have the same control signal and, therefore, 
the same tracking performance. 

Finally, consider the case of the suppressed carrier with 
subcarrier (A = 90 deg in Eq. [1]). In this scenario, the 
suppressed carrier loop control signal is obtained by form- 
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ing the product ZiZ qi9cai as shown in Fig. 1. The signal 
Z,-, which was shown to be the subcarrier inphase com- 
ponent, now becomes the suppressed carrier inphase com- 
ponent because it is already proportional to cos (<£ c ) and 
Fi(<f > su ), as indicated by Eq. (7). The suppressed carrier 
quadrature component Z qtSca is generated by mixing the 
digitized IF signal with the quadrature carrier reference 
and the inphase subcarrier reference, and then accumulat- 
ing the resulting samples over a symbol duration. Hence, 
the suppressed carrier inphase and quadrature signals are 
given as [1] 

Zi = y/Pda,F I (<j>, u )cos{<j> c ) + n Zi (11) 

and 

Zq,,ca = '/Pda,F I (<t> su ) sm(<t> c ) + n Zqt , ca (12) 

where nz { and nz q 9ca are independent and have variance 
Nq/(2T). It will be shown in Section IV that the inphase 
and quadrature signals of the BTD suppressed carrier loop 
have the same statistics as are found in Eqs. (11) and (12). 
Hence, both suppressed carrier loops will have the same 
error signal and tracking performance. 


III. Baseband Recording 

As indicated earlier, the BTD processes the received 
signal after the signal has been open-loop downconverted 
to baseband on a subcarrier harmonic basis and recorded 
on tape. This section describes two methods for record- 
ing the received signal at baseband. The first recording 
scheme, depicted in Fig. 2, distributes the sampled IF 
signal into two carrier channels and 4 n subcarrier chan- 
nels, where n denotes the number of subcarrier harmonics 
present at the A/D output and is related to L (the high- 
est recorded subcarrier harmonic) through L = 2n — 1. 
Recording of the carrier requires two channels because 
baseband demodulation requires both inphase and quadra- 
ture samples. Similarly, recording of each subcarrier har- 
monic requires four channels — two channels (one in phase 
sub carrier channel and one quadrature subcarrier channel) 
per carrier channel (inphase and quadrature). Table 1 de- 
picts the loss in data power as a function of the highest 
recorded subcarrier harmonic L. 

Each band of the sampled IF signal is recorded at base- 
band after mixing it with tones tuned to its predicted cen- 
ter frequency. The predicts are assumed to be close, but 
not exactly equal, to the actual band center frequency. For 


example, consider the two channels that record the resid- 
ual carrier. In the residual carrier sine channel, the sam- 
pled IF signal is mixed with \/2 sin 'F c (kT s ), where ^f c (kT 9 ) 
is an estimate of the actual carrier phase ^ c (kT s ), and is 
lowpass filtered to obtain the recorded waveform C 9 (kT 9 ). 
Similarly, in the residual carrier cosine channel, the sam- 
pled IF signal is mixed with \/2 cos $ c (kT s ) and lowpass 
filtered to obtain C c (kT s ). Given the information in Ap- 

pendix A, let Q c (kT s ) = ^ c (kT,) - V c (kT,), the recorded 
carrier is given by 


C, = \/P^cos(Q c ) + n c „ 

(13) 

C c = x/^c sin (Qc) + n c c 

(14) 


where, using the Heaviside operator notation, 1 

n c , = tf c (z) {-7ij sin(0 c ) - n 2 cos(0 c )} (15) 

n Ce = H e (z) {nj cos(0 c ) - n 2 sin(0 c )} (16) 

The lowpass filter H c (z ) is assumed to be bandlimited 
to B c Hz. Furthermore, since the carrier frequency er- 
ror (derivative of 0 C ) is assumed to be much smaller than 
B C} both n Cc and n Cs are baseband processes limited to B c 
and have a flat, one-sided PSD level equal to No W/Hz. 

In the first approach, the recording of the subcarrier 
harmonics at baseband requires a two-stage downconver- 
sion of the IF signal. The first stage centers the IF spec- 
trum near DC by mixing it with sine and cosine references 
tuned to the predicted carrier frequency. The second stage 
mixes each of the subcarrier harmonics to baseband indi- 
vidually by mixing the output of the first stage with sine 
and cosine references tuned to multiples of the predicted 
subcarrier frequency. Hence, to record the mth sub car- 
rier harmonic, the IF signal is first mixed with sin('J'c) 
and cos(^ c ), and then with sin (m^f 9U ) and cos 
It is important to note that a phase relationship has to 
be maintained between the various tones that demodulate 
the subcarrier harmonics to baseband. Given the informa- 
tion in Appendix A, the four signals corresponding to the 
recording of the mth sub carrier harmonic are represented 
as (where m is an odd integer) 

SU^ = -V^^-cos[m0 su ]sin(0 c ) + n ( S y ) jj (17) 


1 The Heaviside operation //( 2 ){x(m)} = 5Z™ = _oo x ( n )M m “ n ) 
denotes the convolution of x(m) with h(m). 
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SU^ = ~V^~sin[m0 JU ]sin(0 c ) + (18) 

SU c ( r ) = +\/~Pd— cos[m© JU ] cos(©^) + n^ cj (19) 

7T7TL 

SU = +v^“ sin [ m Q>«] cos (©<:) + n saL ( 20 ) 

where the noise terms are given by 

n^ ss = — H su (z) |[ni sin(0 c ) + n 2 cos(0 c )] sin[m# JU ]| 

( 21 ) 

n^ c = -H au (z) j[ni sin(0 c ) + n 2 cos(0 c )] cos[mtf,„]} 

( 22 ) 


the A/D output with sine and cosine references tuned to 
/c + m/, u (upper sideband) and f c — mf 3ti (lower sideband) 
where f c and /, u denote, respectively, the predicted carrier 
and subcarrier frequencies. Following the same procedure 
used in the derivation of Eqs. (17)-(20) (see Appendix A), 
the recorded signals for this scheme can be shown to be 
(where m is an odd integer) 

SuW = \fP A — co8[0 e + m0 fU ] + n^ ) . (25) 

SU^ = — v^“~cos[0 c — m0 414 ] + (26) 

7rm 9 ~ 

SU% ] = y/Pi— sin[©« + m0.«] + »§SL (27) 

7T77I c ' 

SU™ = -y/Pd— sin[0 c - m©, u ] + ngj? (28) 
irm c ~~ 


= H, u (z) {[n, cos(0 c ) - n 2 sin(0 c )] sin[m*, u ]} 

(23) 

n 5yL - H »u( z ) {["1 cos (©c) - ”2 sin(0 c )] cos[m$, u ]| 

(24) 


It can be verified that n^ c , n su„ > ^su cc , and «st/L are 
baseband processes limited to B SU) the bandwidth of the 
lowpass filter H su {z), and have a flat, one-sided PSD level 
equal to No / 2 W/Hz. Moreover, it can be shown that 
these recorded noises are not only mutually independent of 
each other for every subcarrier harmonic (say, m), but axe 
also independent of the recorded noises at other harmonics 
(any n m). 


Another approach to baseband recording of the received 
signal is shown in Fig. 3. Unlike the previous recording 
scheme, which uses the two-stage downconversion tech- 
nique to record the subcarrier, this scheme records each 
sub carrier harmonic by directly downconverting it to base- 
band. As shown in Fig. 3, the sampled IF signal is still 
distributed into (4n + 2) channels. The two channels used 
for recording the residual carrier remain the same for this 
new scheme. Consequently, the recorded carrier is as given 
by Eqs. (13) and (14). However, the subcarrier harmon- 
ics are recorded at baseband by directly mixing the upper 
and lower sidebands of each harmonic to baseband. That 
is, the mth subcarrier harmonic is recorded after mixing 


where the subscripts and respectively, denote 

the upper and lower sidebands. The noises , 

n S*/ c + } can shown to have a flat, one-sided 

PSD level equal to No within B su Hz, the one-sided band- 
width of the lowpass filter H su (z) in Fig. 3. These up- 
per and lower sideband noises are correlated with respect 
to each other because the recorded signals of the direct 
recording scheme are correlated. Specifically, they are re- 
lated to each other by Eqs. ( 17)— (20) as follows; namely, 


Suj% } = SU™ + SU™ 

(29) 

SU ™ = SUW-SU™ 

(30) 

SU = SUW-SUW 

(31) 

su = st4 m) + suw 

(32) 


and consequently, 


(m) 

su a + 

— n (m) 4- 

— n SU, c + n SU ca 

(33) 

(m) 

“ n SU, c n SU c , 

(34) 

.("0 

SU C + 

(m) (m) 

— n SU cc ~ n SU„ 

(35) 
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(39) 


,( m 

• su c _ 


n 


(m) 


. n ( m ) 


(36) 




= Ht ncc cos (® c ) “ nc ^ sin (®<01 


Mi 


Clearly, is correlated with and n^ + is cor- 

related with n§j^_ • 

IV. Coherent Baseband Demodulation 

This section considers the coherent baseband demodu- 
lation of the carrier and subcarrier when the modulation 
index (A in Eq. [1]) is such that the received signal con- 
sists of a sinusoidal carrier and square- wave subcarrier; it 
also considers the case when the received signal consists 
of a suppressed carrier with square-wave subcarrier (i.e., 
A = 90 deg). The demodulation functions begin by read- 
ing the recorded signals of Section III from a tape drive 
and then processing them to obtain control signals for the 
carrier, subcarrier, and suppressed carrier tracking loops. 
Recall that two methods for baseband recording of the IF 
signal were considered in Section III. It will be shown that 
the baseband processing corresponding to these two meth- 
ods is very similar except for a little extra processing that 
is required for the direct down conversion scheme, the sec- 
ond recording scheme in Section III. In this section, the 
carrier tracking loop, which is the same for both record- 
ing schemes, is described first. Then, the subcarrier and 
suppressed carrier loops for the two-stage downconversion 
recording scheme are described, and followed by the addi- 
tional processing needed for using the signals of the direct 
downconversion scheme. 

The carrier tracking loop, depicted near the bottom 
of Fig. 4, mixes the signals C c and C$ with cos(© c ) and 
sin(0 c ), respectively, before subtracting the mixer outputs 
and accumulating the resulting samples over the loop up- 
date interval to form the carrier loop control signal, V q . 
Hence, 

v q = 4r YM°c cos (©‘) - c > ( 37 ) 

Ml Mr 

where © c is an estimate of the recorded carrier phase 0 C . 
Substituting Eqs. (13) and (14) for C 9 and C c , then ex- 
panding and canceling common terms, and assuming that 
the carrier phase error, <f> c = © c - ©o varies slowly over 
the loop update interval, T u — M\T 8) yields 

Vq = \/~P c s\Ti(<j> c ) + nvq (38) 

where 


and n Cs and n Cc are given by Eqs. (15) and (16). Since 
both n c , and n Cc are zero mean with a flat, one-sided 
power spectral density level equal to No W/Hz within a 
much wider recording bandwidth B c than the loop update 
rate 1/T U has, it can be shown that ny q is Gaussian with 
zero mean and variance Nq/ 2T u . Comparing Eq. (38) with 
Eq. (6) indicates that both the baseband carrier loop and 
the IF carrier loop have the same control signal in terms 
of signal power and noise variance. As a result, the carrier 
tracking variance of the baseband loop will be the same 
as that of a phase-locked loop tuned to the intermediate 
frequency. 

Recall that the square-wave subcarrier was recorded at 
baseband by mixing each of its harmonics down to base- 
band individually and then recording each harmonic sep- 
arately. For each subcarrier harmonic retained, four base- 
band signals given by Eqs. (17) through (20) were actu- 
ally recorded. As shown in Fig. 4, each harmonic of the 
subcarrier is first read from the tape recorder and de- 
modulated individually, and then properly weighted and 
summed to reconstruct the proper inphase and quadra- 
ture samples for the various Costas loops. The demodu- 
lation procedures require that the four recorded signals of 
each subcarrier harmonic be multiplied by the estimated 
carrier references, sin(0 c ) and cos(0 c ), and the estimated 
subcarrier harmonic references, sin(m0 c ) and cos(m0 c ), 
where m is an odd integer that depends on the sub car- 
rier harmonic under consideration. As depicted in Fig. 4, 
the resulting signals are either added or subtracted to re- 
tain terms at 0 C - 0 C and m(Q su - 0,„), and eliminate 
terms at 0 C -f ©c and m(0, u + 0, u ). The details of these 
demodulation procedures are provided in Appendix A. 
Each of the demodulated signals of a particular subcar- 
rier harmonic is then properly weighted and summed to 
reconstruct the desired control signals for various track- 
ing loops. For example, in the subcarrier loop depicted 
in the top half of Fig. 4, the mth harmonic is scaled by 
4/(m7r) on the inphase arm, and [4/(m7r)] sin[(m/2)7rW^ tt ] 
on the quadrature arm {W su is the sub carrier window size). 
Subsequently, the scaled inphase signals corresponding to 
each harmonic are added to form the input to the inphase 
symbol accumulator, whereas the scaled quadrature sig- 
nals are added to form the input to the quadrature sym- 
bol accumulator. The product of the inphase accumula- 
tor output, Yi in Fig. 4, and the quadrature accumulator 
output, Y qi 3 u in Fig. 4, is the subcarrier loop control sig- 
nal. Given the information in Appendix A, the symbol 
accumulator outputs corresponding to the Ith symbol are 
given as 
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Yi = \/Pda,cos{<j> c )-^ ^2 lcos(^ u ) + n,. ( 40 ) 
" j = 1 ' 3 ' 

j :orf<i 


and 


8 L / 
= y/f*d a i cos(</> c ) — ^ ^ ( 
* j=i ' 

j':odd 


sin((i/2)fl-W, u ) 

= sin(;0 JU ) 


+ ”>»,»« ( 41 ) 

where the independent noise terms, ny, and ny ? iu , are 
shown in Appendix A to have variance N 0 /(2T) and 
NqW su /(2T), respectively, as L approaches infinity in the 
above two equations. That is, for sufficiently large Z, the 
noise variances of ny t and n Yq su are approximately the 
same as the variances of nz i and nz q 8U in Eqs. (7) and (8), 
respectively. Furthermore, the signal parts of Eqs. (40) 
and (7), and of Eqs. (41) and (8), are also equal as L ap- 
proaches infinity. This can be easily shown by using the 
Fourier series expansion of Fj(<f> su ) and Fq(<(> 9U ) in Eqs. (7) 
and (8), given as follows: 

O 00 1 

Wsu) = -2 £ -7 cos <j+») (42) 

i=i J 

j :odd 


and 


*«(*.«> = 


sin[(i/2)7r^ JU ] 


sin (j<j> su ) (43) 


j = i 
j :odd 


8 x a 2 

Y q ,, ca = \/P^a,sm(<t> c )— Y2 -72 cos(j>, u ) + ny ? >co 

;= i 3 

j odd 

( 44 ) 

As before, the equivalence of the suppressed carrier loops 
of the BTD and ARX II is established by comparing the in- 
phase and quadrature accumulator outputs of both loops. 
The equivalence of the inphase arms has already been 
shown earlier. The signal parts of the quadrature arm 
accumulations given by Eqs. (12) and (44) are seen to be 
equivalent after letting L go to infinity in Eq. (44) and 
then using Eq. (42) for the infinite sum. The noise parts 
of Eqs. (12) and (44), as L approaches infinity in Eq. (44), 
can also be shown to have the same variance (No/2T). 
Furthermore, it can be shown that n Y% and ny^ gca are 
uncorrelated. 

Finally, consider the coherent baseband demodulation 
of the signals that were recorded as a result of the di- 
rect downconversion scheme, the second recording scheme 
described in Section III. As indicated earlier, the demod- 
ulator for the direct scheme is very similar to the two- 
stage downconversion scheme demodulator except that the 
former requires a little extra processing. Specifically, the 
recorded signals for the direct scheme, given by Eqs. (25)~ 
(28), are combined to yield the signals SUi?\ SUs?\ 
SU ( C T\ and SU^\ which were the input signals to the 
two-stage demodulator described in this section. From 
Eqs. (29)— (32) , it is clear that the input signals to the two- 
stage demodulator can be generated by using two adders 
and two sub tractors (per harmonic) on the signals SU 8 +\ 

SUg™\ and After these additions and sub- 

tractions, shown in Fig. 5, the rest of the signal processing 
remains the same as in the previous scheme. 


As a result of the subcarrier loop accumulator outputs of 
the ARX II and BTD being approximately equal for suf- 
ficiently large L, the control signals, YiY q>su and Z{Z q9U , 
and tracking variance of both loops can also be shown to 
be equal. 

For suppressed carrier tracking, the suppressed carrier 
loop, also shown in Fig. 4, is a Costas loop sharing the 
same inphase arm with the subcarrier Costas loop. Hence, 
the suppressed carrier inphase accumulator output, Yi , is 
as given in Eq. (40). The processing in the suppressed 
carrier loop quadrature arm is very similar to that of the 
sub carrier loop. Hence, following the same derivation as in 
the subcarrier loop quadrature arm, the suppressed carrier 
quadrature arm accumulation over the Ith symbol can be 
shown to be 


V. Requirements for Software 
Implementation of the BTD 

As indicated earlier, the BTD depicted in Fig. 4 is tar- 
geted for implementation on a general-purpose engineer- 
ing workstation. The primary concern with a software 
implementation is whether a general-purpose workstation 
can provide the computational speed required for real-time 
processing of the recorded signals. For example, if the re- 
ceived symbol rate is IK symbols/sec and the A/D sam- 
pling rate is 10K samples/sec, then a real-time demodu- 
lator would process 10K samples/sec and detect IK sym- 
bols/sec. This section first determines the computational 
speed required (in operations/sec) for implementing the 
BTD in real time when the symbol rate is IK symbols/sec 
and the sample rate is 10K samples/sec. Next, it deter- 
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mines the feasibility of implementing the BTD on a Sun 
Sparc 2 (SS-2) and Sun Sparc 10 workstation. 

Recall that the two modes for the BTD are the residual 
carrier with subcarrier mode (A 90 deg) and the sup- 
pressed carrier with subcarrier mode (A = 90 deg). The 
number of (addition and multiplication) operations/sec, 
for a given mode, can be computed by separately counting 
the operations/sample and operations/symbol in Fig. 4 
and then multiplying by the operations at the sampling 
rate and the operations at the symbol rate. Mathemati- 
cally, 


( (number of and “x”) operations^ 

operations = ; J 

\ sample / 


' 10K samples \ 
sec / 


+ 


f (number of and “x”) operations \ 


V 


symbol 


j 


'IK symbols \ 


sec 


(45) 


where the sample and symbol (floating point) operations 
for the coherent demodulator in Fig. 4 are tabulated in 
Table 2(a). The total number of operations in Table 2(a) 
is accounted for in more detail in Table 2(b). The coherent 
demodulator for the direct downconversion scheme needs 
four more sample operations per harmonic than that of 
the two-stage downconversion scheme, as shown in Fig. 5, 
and is not considered separately. Note that both addi- 


tion and multiplication are counted as one operation each. 
Furthermore, the lookup table operations are done at the 
10K samples/sec rate, since the lookup table is consid- 
ered to be part of the carrier and sub carrier numerically 
controlled oscillators (NCO’s). With the number of har- 
monics, n, equal to three (i.e., the first, third, and fifth 
harmonics are recorded) in Table 2(a), there are 71 oper- 
ations/sample and 38 operations/symbol for the residual 
carrier mode, and there are 98 operations/sample and 39 
operations/symbol in the suppressed carrier mode. Con- 
sequently, at IK samples /sec and 10 KHz sampling rate, a 
real-time processor would be required to provide 748K and 
101 9K operations for the residual carrier and suppressed 
carrier modes, respectively. The number of operations per 
second and the SNR degradation as a function of subcar- 
rier harmonics are tabulated in Table 2(b). Table 3 lists 
the throughput utilized by a software implementation of 
the BTD for various computers. The throughput is defined 
as the number of instructions a computer can execute per 
second. In Table 3, the percentage of throughput utilized 
by a Sun SS-2 was obtained by simulating a portion of 
the BTD on the Sun SS-2 and then linearly extrapolating 
the results to determine the throughput utilization for a 
software implementation of the entire BTD. The through- 
put of the remaining computers was obtained by linearly 
extrapolating the Sun SS-2 results using the SPECfp92 
benchmark. The specmark rating of a computer, obtained 
by running a collection of commonly used programs on 
a target system, is a measure of the computer’s perfor- 
mance relative to a VAX- 11/780. For example, the Sun 
SS-2 is expected to perform 22.8 times better than a VAX- 
11/780, since it specmarks at 22.8. Alternatively stated, 
the throughput utilization of the Sun SS-2 is expected to 
be (1/22.8) that of a VAX-11/780. It is clear from Table 3 
that several computers will be able to process the recorded 
data in real time. 
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Glossary 


*c = 

0C = 

©c = 
<f>c = 


0 >U = 
0JU = 

<f>su — 

xsu$? } = 
st4 m) = 
xsul7 ] - 
su%P = 

XSU c ( , m) = 
5C/ c ( , m) = 
XSU^P - 

SU e ( c m) = 
> = 

5t/ J ( + ) = 

XSt/JZ 0 = 
5£/ J ( ” ) = 

X5t/ C ( r ) = 
su™ = 
xsu™ = 
SU™ = 


Incoming carrier phase 
Predicted carrier phase 
Carrier phase offset, 

Carrier phase offset estimate 
Carrier phase offset error, 0 C — 0 C 
Incoming subcarrier phase 
Predicted subcarrier phase 
Subcarrier phase offset, 

Subcarrier phase offset estimate 
Subcarrier phase offset error, Q sll — Q 9U 

mth subcarrier harmonic, derived from mixing the received signal with \/2 sin(4 r c ) sin(m4 f JU ) 

r 

The lowpass filtered version of XSU$T ^ 

mth subcarrier harmonic, derived from mixing the received signal with y/2 sin(9 c ) cos(m^ ati ) 
The lowpass filtered version of XSU»T^ 

mth subcarrier harmonic, derived from mixing the received signal with \/2 cos(^ c ) sin (m^ 3U ) 
The lowpass filtered version of XSUcT^ 

mth subcarrier harmonic, derived from mixing the received signal with \/2 cos(^ c ) cos(m^ #u ) 
The lowpass filtered version of XSUffi 

mth subcarrier harmonic, derived from mixing the received signal with y/2 sin(¥ e + m^ su ) 
The lowpass filtered version of XSUffi 

mth subcarrier harmonic, derived from mixing the received signal with \/2 sin(^ c — m$f 8ti ) 
The lowpass filtered version of XSU 9 ™^ 

mth subcarrier harmonic, derived from mixing the received signal with \/2 cos($ c + m^ su ) 
The lowpass filtered version of XSUe+^ 

mth subcarrier harmonic, derived from mixing the received signal with \/2 cos(^ c — m^ au ) 
The lowpass filtered version of XSU^ 
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Appendix A 


I. Derivation of Eqs. (13) and (14) 

As shown in Fig. 2, the lowpass filter inputs XC S and XC c are given as 


XC, = r\/2sin(^ e ) 

(A-l) 

XC e = r v^cosfi) 

(A-2) 


where the digitized IF, r, is given by Eq. (4) and V c is the predicted carrier phase. (The index kT, is omitted for 
simplicity of notation.) Substituting Eq. (4) into Eqs. (A-l) and (A-2), then expanding and letting 0 e 'Fc — 
yields 


XC, = v^cos(0 c ) — ni sin(0 c ) - n 2 cos (0 C ) — \fP&d 


- Y' 4 sinfi^.u] 

. ' > =1 3 

\ i-odd ) 


sin(0 c ) 


+ ^ terms at angular frequency (\P C + ^c)') 


(A-3) 


and 


XC e = \/f^ s i n (@e) + «i cos (0 C ) - 7i2 sin(0 c ) + s/Pdd 


4 r — 1 . r . T , 
* 7ZT 3 


\ J=1 

\ j-.odd 


cos(0 c ) 


+ ^ terms at angular frequency (^ c + 


(A-4) 


where the instantaneous angular frequency (in radians/sec) of the high-frequency components is denoted as (^c + V c )' , 
the time derivative of the total phase 'P c + 

From Eqs. (A-3) and (A-4), it is clear that XC, and XC c have the desired baseband carrier at angular frequency, 
0'— as well as the undesired data spectra centered at 0' ± jV, u , where tf' u is the angular subcarrier frequency and 
j is an odd integer, 1 < j < L — and other undesired terms at frequency (tfc + 4' c )'. The recorded baseband carrier 
given by Eqs. (13) and (14) can be obtained by lowpass filtering XC, and XCc with the filter l/ c (z) whose bandwidth 
B e is small enough to accommodate only the desired baseband carrier, i.e., 


Be < (-©« + Ku - u d)/(2r) (in Hz) 


where uj is the bandwidth (in radians/sec) wherein most of the data power resides. 
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II. Derivation of Eqs. (17) Through (20) 


As shown in Fig. 2, recording of the mth harmonic of the data- modulated subcarrier requires downconverting this 
particular harmonic to baseband by mixing XC S and XC c with both the inphase and quadrature sub carrier references 
tuned to the predicted frequency of the mth harmonic. Here, m is an odd integer with 1 < m < L, where L is the 
highest subcarrier harmonic to be recorded. The four signals that are generated as a result are as follows: 


XSUfr) = XC,sin(m*, u ) 

(A-5) 

XSU W = A'C, cos(mtf, u ) 

(A-6) 

XSU^ } = XC c sin(m'®,„) 

(A-7) 

XSU^ = XC e cos(rT»^, u ) 

(A-8) 


It can be found that (with 0* u = ^ 5U — ^ Jti ) 


XSU ^ — \/^cos(0 c )sin(m^, u ) - sin(0 c ) cos(m0 iU ) 

m it 


, — 2d 

- y/Pd — sm(@ c ) 
7 r 


' L , ' 

) ' — cos 

. . 3 = 1 3 

\j -.odd, j£m 


- [«i sii^Gc) + n 2 cos(0 c )] sin(m'l r JU ) 


+ 


^ terms at angular frequency ('P< ; + 'P c )' 


(A -9) 


XSU = x/ r fycos(O c )cos(m4' au ) - y/pj— sin(0 c ) sin(m0 au ) 


mir 


2d , , . 

y/Pd — sin(0 c ) 

7T 


y sin (jy su j j [^1 Sin(O c ) 7X2 COS^Oc)] COs(f7l^j U ) 


i i-i 

\j :odd, j^m 



terms at angular frequency (^ c + $ 




(A-10) 


<y i 

XSU j™) — \/P^sin(Q c )sin(m^, u ) + yfPd cos(0 c ) cos(m0, u ) 


mn 


j — 2d 

+ y/Pd — cos(0 c ) 

TT 


' * , ' 

53 “ COS (&•* _ m *au) 

, j=» 3 

\j:odd, j^m 


+ [ni cos(0 c ) — n 2 sin(0 c )] sin(m4', u ) 


+ ^terms at angular frequency ('J , c + 


(A- 11) 


$0 


2 d 


XSU W = \//^sin(0 c ) cos(m'I r JU ) + y/Pd cos(0 c ) sin(m0,„) 


mi 


+ \/Pd— cos(0 c ) 

7T 


t L J ' 

y, -r sin 
. ;=i J 

\j:odd, j^m 


+ [ni cos(0 c ) — 712 sin(0 c )] cos(m¥ aii ) 


+ 


^terms at angular frequency (^ c + 



(A- 12) 


It is clear that the baseband signal component in each of 
XSU ( ,?\ xsul?\ XSUi T\ and XSU& is represented 
by the second term in each of the above four equations, 
since the subcarrier frequency error, denoted by 0' u , is 
assumed to be very close to zero. The recorded mth sub- 
carrier harmonic signals given in Eqs. (17) to (20) for 
any odd integer 1 < m < L are the corresponding base- 
band components in Eqs. (A-9) to (A-12). These signals 
can be extracted by lowpass filtering XSUs?\ XSU$?\ 
XSU&\ and XSU with the filter H su (z), of which 
the bandwidth B su is wide enough to accommodate both 
the carrier frequency offset and the maximum subcarrier 
frequency offsets (up to the highest subcarrier harmonic 
recorded) as well as the data modulation, namely, 

Bsu > (©c + L&'su + Wrf)/( 2tt) (in Hz) 

where is the bandwidth (in radians/sec) wherein most 
of the data power resides. 

III. Derivation of Eqs. (40) and (41) 

In referring to the subcarrier loop in Fig. 4, it is seen 
that the mth subcarrier harmonic waveforms 

SU$?\ SUs?\ SU[T\ and SU are demodulated as 
follows: 

= SU™ cos(0 c ) - St/^ m) sin(0 c ) (A-13) 

BW = SUi^cos(e c )-SU^sin(Q c ) (A-14) 

where the references sin(0 c ) and cos(0 c ) have the esti- 
mated carrier error phase 0 C obtained from the carrier 
tracking. It can be found that 

A — y/Pd-— cos(m0 Jti )cos(^ c ) + (A-15) 


B = v^~“ sin ( m ©*ti) cos (<M + (A-16) 

mw 

where 

4 m) = 4S. «*(©<=) - 4S. sin (©<=) ( A - 1? ) 

n ( B m) = "sUec cos (©<=) - sin (®c) ( A - 18 ) 

It is clear that both n ^ and have a flat, one-sided 
PSD level equal to Nq/2 W/Hz within the frequency inter- 

val (0, B,„] Hz, since and "soL have 

the same PSD level (A r o/2, one-sided) within the same 
bandwidth ( B su ). Note that B au is the bandwidth of the 
filter H au (z ) in Fig. 2. Furthermore, it can be shown that 
both n ^ and n ^ are not only independent of each other, 
but also independent of their counterparts at other sub car- 
rier harmonics. 

The signals A^ m ^ and B^ are multiplied by the sub- 
carrier references and then added or subtracted, as shown 
in Fig. 4, to remove the subcarrier frequency error. The 
resulting signals, and are given as follows: 

xl m) = A^ m ^ cos(ro0 JU ) + sin(m0 JU ) (A-19) 

cos(m0 JU ) — A^ m ^ sin(m0 JU ) (A-20) 

Substituting the expressions for A and B^ m \ then let- 
ting <j> c =.Q c - 0 C and <j> su = 0 JU — 0 JU , yields 

X- m ^ cos(m<^ u ) cos(<j> c ) + (A-21) 

m 7r 1 

X$l = \fPi— sin(m^, u ) cos(<0 c ) + n^ ju (A-22) 
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where 

cos(m0 4ti ) + sin(m0 4U ) (A-23) 
cos(m0 5W ) - n^ m ' ) sin(m0 4ti ) (A-24) 

In referring to the weighted summer in Fig. 4, it is seen 
that the mth subcarrier harmonic X is weighted by 
4/(m7r) and then added to the other inphase harmonics 
to obtain the input to the inphase symbol accumulator, 
denoted as X{. Hence, 


Xi = Y, *i m) — 

X- — / ’ m'tr 


m = l 
m.odd 


7717T 


= v^Jco,(#.)4 t — + "*■ (A-25) 


7T* * — * m L 

m=l 
m.odd 


where 


nx 


= - Y 

' JT 


L (m) 


cos(mQ JU ) + sin(mO >u ) 


m=l 

m.odd 


Similarly, the input to the quadrature symbol accumula- 
tor, X g sii , is obtained by weighing the mth subcarrier har- 
monic by [4/(m7r)] sin[(m/2)7rPV, u ] and then adding 

it to the other quadrature harmonics. Namely, 


m = l 
m:odd 


4 . 

(m _ _ _ \ ’ 

sin 


7717T 

V 2 / 


Q L 

= \/Pdd cos(<f> c Y sin (y 7rlV,u ) 


sin(m^, u ) 


m = 1 
m.odd 


m* 


+ n X( 


q,s ti 


(A-27) 


where 


n Xq,s» = “ E sin (’J 7rWsu ) 

m=l ** 

m.odd 

ng cos (m6, u ) — n^ n) sin(roQ, u ) 


(A-28) 


(A-26) The variance of n x> in Eq. (A-26) can be found as 




= 


( 4 L 

1 E 

\ m.odd 


cos (m0 9U ) -f n K ^ } sin(m0, 
m 


0 "), 



/4\ 2 ^ E{(n^)‘ 

! } cos 2 (m0, u ) + E {(n^) 5 

! }sin 2 (m0,„) 

W E 

v ' m = l 

m.odd 

m 2 



(A-29) 


where 


E {(»<,”))>} = £ {(„«)’} = fB.u Vra 


as well as their counterparts at other subcarrier harmon- 
ics. After letting the highest recorded subcarrier harmonic 
order L go to infinity, and then using the identity 


Here, B su is the bandwidth of the filter H su (z ) in Fig. 2. 
The last equality in Eq. (A-29) holds because and 
n ^ are mutually independent with respect to each other 


1 7T 

m 2 = Y 


m~ 1 
m.odd 
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it is easily shown that the variance of nx t approaches twice 
the variance of either or Therefore, has a 

flat, one-sided PSD level equal to N 0 W/IIz, which is twice 
the PSD level of n^ m) or within B su Hz. Similarly, it 
can also be shown that nx ?)JU in Eq. (A-28) has a flat, one- 
sided power spectral density level approximately equal to 
(W su No) W/Hz within the lowpass arm filter bandwidth 
B sri Hz. This result is as expected since windowing the 
quadrature sub carrier reference reduces the noise power 
by the factor W su • 


where 

" ( c m) = n %], cos (©c) + sin (0 C ) (A-34) 

4 m) = n%] c cos(Q c ) + n%l c sm(e c ) (A-35) 

and n%] f , ngg e , ngg,, and are as given by 

Eqs. (21) to (24). 


In referring to Fig. 4, it is seen that the subcarrier loop 
inphase and quadrature accumulator outputs, Y+ and Yg tSU , 
are obtained by averaging samples of X% and Xq t$u over 
a symbol duration. Assuming that <f> c and m<f) su for all 
m are approximately constant over a symbol duration, Y| 
and Yq tSU are as given by Eqs. (40) and (41). It is also true 
that, since the one-sided PSD levels of nx t (Nq W/Hz) and 
nxq )S u ([W su No] W/Hz) are flat over B 8U Hz and since B su 
is much greater than the symbol rate (1/T), the variance 
of ny t and the variance of ny qsu are equal to N Q /(2T) and 
W $u No/(2T) } respectively. Furthermore, since 

| odd integer m , 1 < m < l| 


It is clear that both and n ^ have a flat, one-sided 
PSD level equal to No/2 W/Hz within the frequency in- 

terval [0, B, u \ Hz, since n^, n%] c , n^, and 
have the PSD level equal to N 0 /2 W/Hz within the same 
band. Note that B 3U is the bandwidth of the filter H su (z) 
in Fig. 2. It is easy to show that both and 
are not only independent of each other, but also indepen- 
dent of their counterparts at other subcarrier harmonics. 
Moreover, it can be shown that 


| odd integer m, 1 < m < 


(A-36) 


is a set of mutually independent random samples, it is 
straightforward to show that nx t and are indepen- 

dent, which in turn assures that ny t and ny q su are also 
independent. 

IV. Derivation of Eq. (44) 

In referring to the suppressed carrier loop in Fig. 4, it is 
seen that the recorded mth subcarrier harmonic waveforms 
SU$™ \ SUi?\ SUi?\ and SUcT ^ are demodulated as 
follows: 

C< m > = SUW cos(0 c ) + SU^ sin(0 c ) (A-30) 

£>< m) = SUW cos(Q e ) + Sf4 m) sin(© c ) (A-31) 

where © c represents the estimated carrier error phase. Af- 
ter substituting Eqs. (17) to (20) into the above equations, 
it can be found that 


is a set of mutually independent random samples. 

The signals and are multiplied by the sub- 

carrier references and then added to remove the subcarrier 
frequency error. The resulting signal Xq/l\ a is as follows: 

cos(m0* u ) + D (m) sin(m©, u ) 

(A-37) 

(A-38) 

where 

n ^Xq sea = n c^ cos(7n© 3U ) + n ^ sin(m© 3U ) (A-39) 


C = -\fPd~— cos(m© su ) cos(^ c ) + (A-32) 

m 7r 

D = -\/Pd’^sm(mQ su )sin((f> c ) 4- (A-33) 


In a manner similar to forming the weighted sums in the 
subcarrier tracking, the mth subcarrier harmonic, 
is weighted by 4/(m7r) and then added to the other quadra- 
ture harmonics to obtain X ?)3ca , the input to the quadra- 
ture arm accumulator. Thus, 
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X,,,. = y/PidM*.)^ E + **,.. 


m— 1 
m’.odd 


(A-40) 


where 


A L (m) 

4 ^ } 

n Xq y9C <i ~ 


co s(m6 JU ) j- rt^ sin (mQ su ) 


m— 1 
m\odd 


m 


Using the same procedure as in the derivation of the PSD 
level of nx,, it can be easily verified that nx q)SCa has a 


PSD level approximately equal to Nq W/Hz within the 
frequency interval [0, B su ] Hz. In referring to Fig. 4, it 
is seen that the suppressed carrier loop quadrature arm 
accumulator output, Yg tSCa , is obtained by averaging sam- 
ples of Xq jS ca over a symbol duration. Assuming that <f> c 
and m<£ au , for all m, are approximately constant over a 
symbol duration, the signal Y q>sca is as given in Eq. (44). 
Furthermore, assuming that B $u is much greater than the 
symbol rate (1/T), the variance of ny qsca can be shown 
to be equal to Nq/{2T). 


By using Eq. (A-36), it is straightforward to show that 
nx t and nx ? 3ca are independent, which in turn assures 
that ny. and ny q sca are independent. 
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Table 1. Loss In data power as a function of subcarrier harmonics. 


Highest recorded 
subcarrier harmonic, 
L 

Number of subcarrier 
harmonics at A/D 
output, n 

Loss, dB 

1 

1 

0.91 

3 

2 

0.45 

5 

3 

0.30 

7 

4 

0.22 

9 

5 

0.18 


Table 2(a). Required number of operations for the demodulation function (n is the 
number of harmonics recorded). 


Mode 

Multiplication 
operations 
per sample 

Multiplication 
operations 
per symbol 

Addition 
operations 
per sample 

Addition 
operations 
per symbol 

Lookup table 

Residual carrier with subcarrier 

6 + 8n 

14 

13 -|- 6n 

24 

4 + 2n 

Suppressed carrier with subcarrier 

4 + 14n 

14 

12 + lOn 

25 

4 + 2n 

Suppressed carrier with no subcarrier 

6 

12 

10 

17 

4 
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Table 2(b). Detailed list of operations. 


Three inodes with types of 
operations for each 

Number of 
multiplication 
operations 
per sample 

Number of 
multiplication 
operations 
per symbol 

Number of 
addition 
operations 
per sample 

Number of 
addition 
operations 
per symbol 

Lookup 

table 

Residual carrier with subcarrier 

Residual carrier phase detector 

2 

0 

1 

0 

0 

Residual carrier accumulator 

0 

0 

1 

0 

0 

Residual carrier loop filter 

0 

3 

0 

5 

0 

Residual carrier numerically controlled oscillator 

0 

1 

1 

1 

2 

Subcarrier phase detector 

8n 

0 

2 + 6n 

0 

0 

Subcarrier accumulator 

0 

0 

0 

1 

0 

Subcarrier loop filter 

0 

3 

0 

5 

0 

Subcarrier numerically controlled oscillator 

0 

1 

1 

1 

2n 

Symbol synchronization phase detector 

0 

1 

1 

1 

0 

Symbol synchronization accumulator 

0 

0 

0 

1 

0 

Symbol synchronization loop filter 

0 

3 

0 

5 

0 

Symbol synchronization numerically controlled oscillator 

0 

1 

1 

1 

2 

Residual carrier lock detector 

2 

0 

2 

0 

0 

Subcarrier lock detector 

2 

0 

2 

0 

0 

Split- symbol SSNR estimator 

0 

1 

1 

3 

0 

Total 

6 + 8n 

14 

13 + 6n 

24 

4 4- 2n 

Suppressed carrier with sub carrier 

Carrier/subcarrier phase detector 

14n 

0 

3 4* 10n 

0 

0 

Carrier /subcarrier accumulator 

0 

0 

0 

2 

0 

Carrier/subcarrier loop filter 

0 

6 

0 

10 

0 

Carrier/subcarrier numerically controlled oscillator 

0 

2 

2 

2 

2 4- 2n 

Symbol synchronization phase detector 

0 

1 

1 

1 

0 

Symbol synchronization accumulator 

0 

0 

0 

1 

0 

Symbol synchronization loop filter 

0 

3 

0 

5 

0 

Symbol synchronization numerically controlled oscillator 

0 

1 

1 

1 

2 

Suppressed carrier lock detector 

2 

0 

2 

0 

0 

Subcarrier lock detector 

2 

0 

2 

0 

0 

Split-symbol SSNR estimator 

0 

1 

1 

3 

0 

Total 

4 4 - 14 n 

14 

12 4- 10n 

25 

4 4- 2n 

Suppressed carrier with no subcarrier 

Suppressed carrier phase detector 

4 

1 

4 

0 

0 

Suppressed carrier accumulator 

0 

1 

0 

0 

0 

Suppressed carrier loop filter 

0 

3 

0 

5 

0 

Suppressed carrier numerically controlled oscillator 

0 

1 

1 

1 

2 

Symbol synchronization phase detector 

0 

1 

1 

1 

0 

Symbol synchronization accumulator 

0 

0 

0 

1 

0 

Symbol synchronization loop filter 

0 

3 

0 

5 

0 

Symbol synchronization numerically controlled oscillator 

0 

1 

1 

1 

2 

Suppressed carrier lock detector 

2 

0 

2 

0 

0 

Split-symbol SSNR estimator 

0 

1 

1 

3 

0 

Total 

6 

12 

10 

17 

4 
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Table 3. Throughput comparison (n is the number of harmonics recorded). 


System 

SPECfp92 

Percent throughput utilized 
(n = 3) 

Percent throughput utilized 
(n = 5) 

Sun SS-2 

22.8 

100 

151 

Sun SS-10/30 

52.9 

43 

65 

Sun SS-10/41 

64.7 

35 

53 

Sun SS-10/52 

71.4/CPU 

32 (on one CPU) 

48 (on one CPU) 

SGI Crimson 

63.4 

36 

54 

HP 710 

47.6 

48 

72 

HP 750 

75.0 

30 

45 

IBM 340 

51.9 

44 

66 

IBM 560 

85.6 

27 

41 

Intel Xpress (486) 

14.0 

160 

242 
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Symbol Signal-to-Noise Ratio Loss in Square-Wave 
Subcarrier Downconversion 

Y. Feria and J. Statman 
Communications Systems Research Section 


This article presents the simulated results of the signal-to-noise ratio (SNR) loss 
in the process of a square- wave sub carrier downconversion. In a previous article 
[2], the SNR degradation was evaluated at the output of the downconverter based 
on the signal and noise power change. Unlike in the previous article, the SNR loss 
is defined here as the difference between the actual and theoretical symbol SNR’s 
for the same symbol-error rate at the output of the symbol matched filter . The 
results show that an average SNR loss of 0.3 dB can be achieved with tenth-order 
infinite impulse response (HR) filters. This loss is a 0.2-dB increase over the SNR 
degradation in the previous analysis where neither the signal distortion nor the 
symbol detector was considered. 


I. Introduction 

A signal with a downconverted square-wave subcarrier 
suffers distortion caused by the nonideal filtering, the use 
of a finite number of harmonics, the data bandwidth cut- 
off, and so on. This distortion results in a loss of signal- 
to-noise ratio (SNR). In a previous article [2], the SNR 
degradation at the output of the downconverter was mea- 
sured under the following definition: 

SNR degradation, dB = SNR^ ea i — SNR rea ] 

where SNR\<]i e?L \ and SNR r( &\ are the SNR’s after the 
square-wave subcarrier downconversion using ideal and 
realizable filters, respectively. The analysis based on 
this definition, however, does not include the qualitative 
changes, such as distortion, that may have an impact on 
the symbol-error rate (see Fig. 1), nor does it include the 
effect of the symbol detector on the symbol SNR loss. 


For a complete study, the downconverter output is fur- 
ther decimated and fed into a symbol detector, and the 
symbol-error rate is then determined and compared with 
the theoretical symbol-error probability for binary phase- 
shift keying (BPSK) without downconversion [3]. Unlike 
the SNR degradation defined previously [2], the SNR loss 
now is defined as the difference between the required SNR 
and the theoretical SNR for the same symbol-error rate. 

II. Simulation Procedure 

The simulated procedure is illustrated in Fig. 2, where 
the subcarrier down-mixer has been described in [1,2]. The 
simulation assumes symbol synchronization, zero subcar- 
rier phase, and known subcarrier frequency; hence, it fo- 
cuses only on losses due to the nonideal filtering, the im- 
perfect delay compensation, and the data bandwidth cutoff 
in the down-mixing and symbol-detecting processes. The 
simulation employs HR filters with the bandwidths of 2, 4, 
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error rate is compared with both the theoretical symbol- 
error probability for BPSK and the simulated results from 
the direct symbol detection without the downconversion. 

The simulation procedure is as follows: a square- wave 
subcarrier at a frequency of 22.5 kHz is modulated by a 
pseudo-random sequence with clock time of 1/1000 sec. 
A white Gaussian noise is added and its variance is ad- 
justed so that the symbol SNR, E S /N 01 takes values from 
-10 to 10 dB. The relationship between the random noise 
variance, cr 2 , and the symbol SNR, E s /N 0 , is 

2 P'f. 

* 2 f d {E,/N 0 ) 

where P s is the subcarrier power, f s is the sample fre- 
quency, and 1 lf d is the pseudo-noise (PN) sequence clock 
time. The noise-contaminated signal is then passed 
through a bank of bandpass filters (BPF s) with center 
frequencies at the first, third, and fifth harmonics of the 
square- wave subcarrier. Each of the BPF outputs is multi- 
plied by a proper down-mixing signal, and then the prod- 
ucts are added. The sum of the products is then passed 
through a lowpass filter (LPF). The subcarrier now has 
a lower frequency. The total signal bandwidth is reduced 
from 5 times the original subcarrier frequency plus a single- 
sided bandwidth of the data to 5 times the new subcarrier 
frequency plus a single-sided bandwidth of the data. 

After the down-mixing, the original high sample rate 
is no longer necessary; hence, the signal is decimated by 
an integer number equal to L/j/(6/b)J> where f s is the 
original sample rate and fs is the BPF bandwidth. The 
decimated samples are then multiplied by the sum of the 
first three harmonics at the downconverted subcarrier fre- 
quency and fed into an integrate-and-dump filter for the 
symbol detection. The symbol-error rate is obtained by 
taking the ratio of the number of the incorrect symbol 
detections and the total number of symbols. 

At the downconverter output, a delay due to the non- 
ideal filtering is compensated for. The compensation is 
done in terms of sample periods before the decimation for 
the higher resolution. Note that this compensation, how- 
ever, may be off by a fraction of a sample period since the 
delay period may not be an integer multiple of the sample 
period. The SNR degradation due to the symbol synchro- 
nization offset has been previously studied [4], and is thus 
not discussed in this article. The symbol SNR loss from 
the downconversion is then determined by taking the dif- 
ference between the actual and the ideal E s /N 0 in decibels 
required to achieve the same symbol-error rate. 


Three different BPF bandwidths are used in the sim- 
ulation. The* sample rates before and after the downcon- 
version, the total number of samples, and the number of 
symbols used in each case are summarized in Table 1. 

III. Results 

For the BPF bandwidths of 2, 4, and 6 kHz, the 
symbol-error rates are obtained through the simulations 
with the symbol SNR varied from -10 to 10 dB. The 
comparisons between the simulated results and the theor- 
etical symbol-error probabilities for BPSK are shown in 
Fig. 3. The theoretical symbol-error probability for BPSK 
is computed with the complementary error function [3], 
erfc(y/ E s /No) } with an accuracy of 10" 5 . Therefore, the 
measured symbol-error rates for a symbol SNR greater 
than 6 dB may not be very accurate. Another problem 
is the limitation of the number of symbols used in the sim- 
ulations. In this case, 250,000 symbols are used. For a 
symbol-error rate on the order of 10~ 5 , it may be neces- 
sary to simulate 10 6 symbols, which is beyond the available 
time and resource constraints. 

To obtain the symbol SNR loss for a given symbol-error 
rate, first the square of the inverse of the complementary 
error function of two times the symbol-error rate is evalu- 
ated with an accuracy of 10" 10 . The obtained actual SNR 
is then compared with the theoretical SNR. The difference 
of the two SNR’s, ASNR , is the SNR loss. The SNR loss 
is computed for each simulation point, and the results are 
shown in Fig. 4. The points that are beyond 6 dB are not 
accurate, as mentioned earlier; therefore, they may not be 
shown in the figure. 

For each BPF bandwidth, the SNR losses are averaged, 
and the results are given in Table 2, where DC stands for 
downconversion, and SD stands for symbol detection. The 
SNR degradations in the downconversion process without 
symbol detection in Table 2 were obtained in [2]. The SNR 
loss in the process of symbol detection without downcon- 
version is also obtained and shown in Table 2. Further- 
more, the relationship between the SNR loss and the BPF 
bandwidth is shown in Fig. 5. 

For the BPF bandwidth varied from 2 to 6 kHz, the av- 
erage symbol SNR loss decreased from 0.8745 to 0.2850 dB 
in the simulations. The loss is higher than the 0.5- to 
0.1-dB SNR degradation computed in the previous anal- 
ysis [2] without the symbol detection. However, the de- 
crease in SNR loss with wider filter bandwidths agrees 
with the SNR degradation obtained in the previous anal- 
ysis. The SNR loss includes the effect of the cutoff of the 
side lobes of the PN data signal spectrum, the effect of the 
nonideal filtering, and the imperfect delay compensation. 
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0.1-dB SNR degradation computed in the previous anal- 
ysis [2] without the symbol detection. However, the de- 
crease in SNR loss with wider filter bandwidths agrees 
with the SNR degradation obtained in the previous anal- 
ysis. The SNR loss includes the effect of the cutoff of the 
side lobes of the PN data signal spectrum, the effect of the 
nonideal filtering, and the imperfect delay compensation. 

For example, when the BPF bandwidth is 6 kHz, the 
average symbol SNR loss is 0.2850 dB. This loss is higher 
than the SNR degradation in the previous analysis [2], 
which is about 0.1 dB. The loss is due to several factors. 
First, the nonideal filtering causes signal distortion. Sec- 
ond, when the group delay of the output signal at the 
downconverter is not an integer multiple of the sample 
period, there is an SNR loss due to symbol timing mis- 
alignment. Third, when the decimated signal is multiplied 
by the sum of the harmonics, the high-frequency terms of 
the product can cause aliasing if the original signal is sam- 
pled at the Nyquist rate. Last, the integrate-and-dump 
filter acts as a lowpass filter and its performance is not 
as good as that of an ideal LPF. As the bandpass filter 
bandwidth gets narrower, the loss becomes larger. This is 
partly due to the cutoff of the PN data signal bandwidth 
[2], and also when the filter bandwidth needs to be nar- 
rower, the required filter order may be higher for a better 
performance. 

Several attempts were unsuccessful in reducing the SNR 
loss. First, since the integrate-and-dump filter does not 


perform as an ideal LPF, an LPF is put in front of the 
integrate-and-dump filter. However, the symbol-error rate 
increases. The reason is that the transition region is en- 
larged by lowpass filtering. 

It is also found that in the symbol detection process, 
if the incoming signal is multiplied by a square wave in- 
stead of by the first three harmonics of the square wave, 
the symbol SNR loss increases. The reason is that when a 
sine wave is multiplied by a square wave, assuming perfect 
timing and frequency alignment, the resulting rectified sine 
wave contains a constant term plus an infinite series of har- 
monics in its Fourier series expansion, which is not band 
limited. After sampling at a finite rate, aliasing occurs. 
If, on the other hand, only the first three harmonics are 
multiplied by the incoming signal, aliasing will not occur 
given a proper sample rate. 

A higher sample rate of 324 kHz was also employed at 
the input of the downconverter for a 6-kHz BPF band- 
width, but the SNR loss did not reduce significantly. 

IV. Conclusion 

This article presents the symbol SNR loss due to the 
process of the square- wave sub carrier down conversion. 
The symbol SNR loss is measured as the difference be- 
tween the actual symbol SNR and the theoretical SNR for 
the same symbol-error rate. An average loss of 0.3 dB was 
achieved with tenth-order IIR filters. 
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Table 1. Simulation conditions. 


BPF bandwidth, kHz 

2 

4 

6 

Original sample rate, kHz 

264 

264 

288 

Reduced sample rate, kHz 

12 

24 

36 

Samples per symbol 

12 

24 

36 

Number of samples 

6.6 X 10 6 

6.6 x 10 6 

7.2 X 10 6 

Number of symbols 

25 X 10 3 

25 x 10 3 

25 X 10 3 


Table 2. SNR ioss and degradation, dB. 


BPF bandwidth, kHz 

2 

4 

6 

Loss in DC with SD 

0.8745 

0.4218 

0.2850 

Degradation in DC without SD 

0.3964 

0.1655 

0.060G 

Loss in SD without DC 

0.0288 

0.0288 

0.0288 
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Design of the Reduced LQG Compensator 
for the DSS-13 Antenna 

W. Gawronski 

Ground Antennas and Facilities Engineering Section 


A lin ear- q u adr aii c- Ga ussi an (LQG) compensator design procedure is proposed 
for the DSS-13 antenna. The procedure is based on two properties. It is shown that 
tracking and flexible motion of the antenna are a/most independent (the separation 
property). As a consequence, compensators for the flexible and tracking parts can 
be designed separately. It is shown also that the balanced LQG compensator s effort 
is evenly divided between the controller and the estimator. This allows a minimiza- 
tion of the compensator order, which is important for implementation purposes. 
An efficient compensator reduction procedure that gives a stable low-order com- 
pensator of satisfactory performance is introduced. This approach is illustrated 
with a detailed compensator design for the DSS-13 antenna. The implementation 
of this compensator design requires an update of the antenna model. 


I. Introduction 

The linear quadratic controller for the DSS-14 antenna 
was designed by Alvarez and Nickerson [1], and a linear 
quadratic controller for the DSS-13 antenna was designed 
by Gawronski [2]. The design method presented in this 
article extends the results obtained in [2] for the case when 
full-state feedback is not available. 

The development of new high-performance controllers 
for the DSN antennas is a current priority. The existing 
proportional and integral (PI) controllers satisfy the re- 
quirements for X-band (8.4-GHz) tracking; they remain 
simple, robust to parameter variations, and do not re- 
quire detailed knowledge of the antenna dynamics. How- 
ever, due to the recent pointing requirements for Ka-band 
(32 GHz), new performance requirements for the antenna 
controllers have emerged. The PI controllers have reached 
their performance limits, therefore a new generation of 
controllers has to be designed and developed. Also, in 


order to improve controller performance, more sophisti- 
cated and accurate antenna models have to be developed. 
As a rule, the better the knowledge of the plant dynam- 
ics, the better the performance that can be achieved by 
the controller. The recently developed antenna models [3] 
are accurate enough to give an opportunity to improve 
tracking performance. The models allow simulation of si- 
multaneous tracking in azimuth and elevation, and include 
antenna flexible deformations up to 10 Hz. 

Among the family of the model-based controllers, the 
linear-quadratic-Gaussian (LQG) compensator has been 
chosen for the DSN antennas because it is commonly 
known to be performing well in industrial applications. 
The LQG compensator consists of a controller and an esti- 
mator. The controller drives the antenna, and the driving 
control torques are determined from the knowledge of the 
full antenna dynamics. Since only a small part of the an- 
tenna dynamics is measured (by encoders), the estimator 
is implemented to reconstruct the “missing” dynamics. 
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The controller and the estimator designs consist of ad- 
justing their gains through proper determination of the 
controller and estimator weights. For the antenna model 
of order n, 2 n 2 weights have to be determined (n 2 for the 
controller and n 2 for the estimator). This number is cus- 
tomarily dropped to 2 n weights (n for the controller and 
n for the estimator). But in spite of this drastic reduc- 
tion, the number of weights is still too large to make the 
search for the best weights reasonable (typically n — 40 
for the antenna, i.e., 80 weights have to be found). The 
difficulty arises because no general procedure for weight 
determination is available, and the known procedures deal 
with simplified and/or specific cases. 

The weight determination presented here becomes sim- 
ple due to several properties of the antenna and the com- 
pensator investigated in this article. First, it is shown that 
the tracking and flexible motion of the antenna are almost 
separated. The tracking part consists of four states (ele- 
vation and azimuth encoder readings and their integrals); 
thus, instead of dealing with a model of order n, one ob- 
tains two separate models of order 4 and n — 4. Secondly, 
the LQG compensator is balanced such that the controller 
and compensator efforts are the same. For the balanced 
compensator, the weights of the controller and estimator 
are the same, thus instead of 2n weights, n weights need to 
be determined. Thirdly, it is shown that each component 
of the balanced compensator (consisting of two states) is 
almost independent of others. Thus, weights for each com- 
ponent are determined separately. In consequence, the 
search for 2n weights becomes a series of searches for 2 
weights, which obviously is not a difficult task to perform. 

As mentioned, the PI controller is easy to implement 
due to its simplicity. But the implementation of the 40- 
state model-based LQG compensator is not an easy task. 
It would result in a complex algorithm, and would be a 
huge computational burden. Therefore, a simplification of 
the LQG compensator is an important implementation re- 
quirement. The simplification is obtained through order 
reduction of the compensator. The size of the compen- 
sator is reduced, but one has to find a reduction procedure 
such that the reduced-order compensator is stable and its 
performance is still close to the full-order compensator. 
This task is solved by introducing the pole mobility index. 
The pole mobility index characterizes the importance of 
the components of the balanced compensator. The states 
with small pole mobility index are truncated, and the trun- 
cation marginally affects the closed-loop dynamics. The 
closed-loop system with the reduced compensator is sta- 
ble, and its performance is close to the full-order compen- 
sator. It is shown that the reduced-order compensator of 
12 states is stable and its performance is close to the per- 
formance of the full-order compensator of 40 states. 


II. Problem Statement 

The closed-loop system with an LQG compensator is 
shown in Fig. 1, with the plant state-space triple (A, B y C), 
the process noise v of intensity V , and the measurement 
noise w of intensity W, where both v and w are uncorre- 
lated: 


V = E{vv t ), W — E(ww t ) 


E(vw t ) — 0, E(v) = 0, E(w) = 0 


( 1 ) 


where E(.) is an expectation operator. It is assumed that 
W — I without loss of generality. The task is to determine 
the controller gain ( K p ) and estimator gain ( K e ) such that 
the performance index J 

( x t Qx + u T Ru)dt^j (2) 

is minimal, where R is a positive definite input weight ma- 
trix, and Q is a positive semidefinite state weight matrix. 
It is assumed that R = 7, also without loss of generality. 
The minimum of J is obtained for the feedback u — —K p x, 
where the gain matrix 

I< p = B t S (3) 

is obtained from the solution 5 of the controller Riccati 
equation (CARE) [4,5]: 

A t S + SA - SBB t S + Q = 0 (4) 



The optimal estimator gain is given by 

I<e = PC T (5) 

where P is the solution of the estimator Riccati equation 
(FARE): 


AP + PA t - PC T CP + V = 0 (6) 


The LQG compensator performance can be signifi- 
cantly modified through variations of weight Q and vari- 
ance V. Although V is formally predetermined by the 
process noise v y it can be modified in a search for a more 
suitable solution [5,6]. The determination of the weight 
and covariance is addressed in the following section. 
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Another nontrivial issue addressed here is the order of 
the compensator. Although the size of the plant deter- 
mines the size of the compensator, in many cases a full-size 
compensator is not acceptable for implementation due to 
its complexity. Thus, its order must be minimized in such 
a way that the reduced compensator maintains the stabil- 
ity and performance of the full-order compensator. The 
solution to this problem is found through the approximate 
balancing of the CARE/FARE equations. 


III. Quasi-Separation of Flexible and 
Tracking Subsystems 

The open-loop (or rate-loop) state-space representa- 
tion (A,B,C) of the DSS-13 antenna includes the input 
uj = [u pe u pa ], which consists of the rate commands 
in elevation (u pe ) and azimuth (u pa ) and outputs 
yj = \ype y P e ] and yj - [y ie Via ], which consist of the 
elevation and azimuth angles ( y pe , ypa) and their inte- 
grals (yie, T/ia)- Divide the state vector x of the open-loop 
antenna model into the tracking x t and flexible Xj parts 

* T - [xj x T s ) (7) 

where xj = [yj yj] and xj are the remaining states. It 
can be shown [2] that, in this case, the rate-loop represen- 
tation (A,B,C) has the form 

A = , B= |‘] , C = [C, 0] (8) 

and that 

II B t || <|| B } ||, |K/|KIKII. || A t/ 1|< II >l/|| (9) 

For the DSS-13 antenna, || Bt || < 10~ 5 , || Bj || > 1, || 
A t j || < 10 -3 , || Aj || > 10, and || A t || = 1. Thus, the 
states of the tracking part are much weaker than the states 
of the flexible part. The strong and weak signal flow is 
shown in the block diagram of Fig. 2. The strong states of 
the flexible subsystem and the weak states of the tracking 
subsystem are shown in Fig. 3, which presents the transfer 
function plots of the rate-loop systems due to elevation 
rate command. 

In the LQG design, the performance index is minimized 
and the minimum is obtained for K = B T S, and S is a 
solution of the Riccati Eq. (4). Divide S and K into parts 
related to the triple (A, B,C ) in Eq. (8) 


S t St S 

S = , K — [I< t K f ] (10) 

. Sfj Sf _ 

so that Eq. (4) can be written as follows 

Aj St 4- St At — StB t Bj St 4 Qt — A t j = 0 (11a) 
AjStj + S tf A f + S t A iS - Kj K j = 0 (lib) 
AjSj 4 SjA f 4 SjBjBjSf 4 Q/ - A /t = 0 (11c) 

where 

A/t = AjjStf + SjjA t j — 'j 

+S s B s BjS, s > (12a) 

A t} = S t B t Bj Sjj + StjBjKt 

Kf — Bj S t / + Bj S } , K t = Bj S t + BjSfj (12b) 

Taking a closer look at Eqs. (12), notice that there exist 
weights Q t and Qj such that the gain K / depends on the 
flexible subsystem only. Namely, for large enough Qj , such 
that || Qf || > || A/t ||, the solution Sj of Eq. (11c) is 
independent of the tracking system, and for small Q t , one 
obtains WB'fSt/W < \\BjSj\\. In terms of Eq. (12a), the 
latter inequality means that the gain Kj depends only on 
the flexible subsystem. However, due to the master-slave 
relationship between flexible and tracking subsystems, the 
situation is not quite symmetric: There are no such Q t 
and Qj that the gain K t depends only on the tracking 
subsystem. To understand this, note that the term “small” 
has a different meaning for Qj and Q t . Magnitudes of 
small Qj and small Q t are of different order, namely small 
Qj is such that Qj < 10 -7 , and small Qt is such that 
Qt < 1. Therefore, by increasing Q t in order to obtain 
|| Qt || > || A*/ || one obtains || Bj Sj t || and H^StU of the 
same magnitude. According to Eq. (12b), the latter fact 
means that the gain K t depends on the flexible subsystem, 
as well as on the tracking subsystem, and the solution St 
of Eq. (11a) is dependent on the flexible subsystem. This 
property can be validated by observation of the closed- 
loop transfer functions for different weights (Fig. 4). The 
plots show that the variations of Qj changed the properties 
of the flexible subsystem only, while the variations of Q t 
changed the properties of both subsystems. 

The weight Q t should be large enough to achieve the 
pointing performance requirements, and the increase of Q t 
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causes increasing dependency of the gains on the track- 
ing system. For this reason the above independence be- 
comes a quasi-independence in the final stage of controller 
design. Nevertheless, the separation in the initial stages 
of controller design is very strong. The design consists, 
therefore, of the initial choice of relatively small weights 
for the tracking subsystem and determination of the con- 
troller gains of the flexible subsystem. It is followed by 
adjustment of weights of the tracking subsystem and a fi- 
nal tuning of the flexible weights. 

The quasi-separation principle discussed above in the 
case of the controller design is also valid in the case of 
estimator design, since the compensator design consists of 
the independent designs of a controller and an estimator 
[4,5]. Additional properties of the LQG compensator that 
arise in controls of flexible structures are discussed in the 
following section. 


of Q — C T C and V = BB T . Gawronski [11] has shown 
that the balanced LQG representation exists in the case of 
general Q and V, and has derived the transformation to 
the balanced LQG representation. 

Let (A, B , C) be a state-space triple of an open-loop 
system. Its controllability and observability grammians 
W c and W 0 are positive-definite and satisfy the Lyapunov 
equations 


AW C + W c A t + BB t = 0 
A T W 0 + W 0 A + CC T = 0 


(14) 


The system representation is balanced in the sense of 
Moore [12] if its controllability and observability gram- 
mians are diagonal and equal: 


IV. Balanced LQG Compensator for 
Flexible Structures 

In this section the flexible subsystem is considered only 
(subscript “/” i s dropped in this section for simplicity of 
notation). A flexible structure is defined as a controllable 
and observable linear system with distinct complex con- 
jugate pairs of poles ( N poles, N is even) and with small 
real parts of the poles. In other words, it is a linear sys- 
tem with vibrational properties. In the Moore balanced 
coordinates, it consists of n — N/2 components [7,8], and 
each component consists of two states. 

An approximately balanced LQG compensator is con- 
sidered. An approximate equality between two variables is 
used in the following sense: Two variables x and y are ap- 
proximately equal (x =* y) if x — y+e, and ||e||/||y|| < 1. 

The block diagram of a closed-loop flexible system with 
the LQG compensator is shown in Fig. 5. Similar to the 
balancing of controllability and observability grammians 
is the balancing of CARE and FARE equations. Namely, 
there exists a diagonal positive definite M = diag (/*,-), i = 
1 > 0, such that 

5 = P-M (13) 

A state-space representation with the condition 
Eq. (13) satisfied is called an LQG balanced representa- 
tion, and i = l,.. M n represents the characteristic val- 
ues of (A,£,C). Jonckheere and Silverman [9] and Op- 
denacker and Jonckheere [10] have shown that a balanced 
solution for CARE and FARE equations exists in the case 


W c = W 0 - T 2 , r = diag(y u ...,y N ), i = 1 , AT (15) 

where y, > 0 is the *th Hankel singular value of the system. 
In [1 1] it is shown that, for flexible structures, the balanced 
representation (in the Moore sense) produces diagonally 
dominant solutions of CARE and FARE, and in the case 
of Q — V produces approximate LQG balanced solutions 
S and P, such that S 9? P ^ M . Assume a diagonal 
weight matrix Q: 

Q = diag(q i I 2 ) ) f=l,...,n (16) 

Then there exists < q c i , where q Q { >0, i — 1, such 
that 5* = diag ($1/2) solution of Eq. (4), where 

Si = (Ppi - i)/2y ?, Bli = 1 + 2 wf/CiUt (17) 

The proof is presented in the Appendix. The plots of s* 
with respect to qi and 7* are shown in Fig. 6. They show an 
increase of Si with the increase of weight qi , and decrease 
of Si with increase of 7 f or Qu>i . 

A similar result is obtained for the FARE equation, 
namely, for a diagonal V : 

V = diag(vil 2 )i % = 1, ...» n (18) 

there exists V{ < u 0 j, where v 0 i >0, i = l,...,n, such that 
P diag(pi I 2 ) is the solution of Eq. (6), where 
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Pi = (/? et - - 1)/2t, 2 , Pi = 1 + 2tni?/6« (19) 


/?p* — Cct/C* — (Xoil&ci 


(24) 


If the 2 th diagonal entry of P and the respective entry 
of S are equal, say to pi, i.e., 

Pi - Si = pi (20) 

the ith component is LQG balanced; if S and P are equal 
the system is LQG balanced. If 5, P t and M are diago- 
nally dominant, i.e. v,- + e v { S + e s i = pi, with € v i 
and e 9i small (| e vi /v { | < 1 , | e si / Si \ < 1 ), the system is 
approximately LQG balanced. 

It follows from Eqs. (17) and (19) that for Q = 
diag(qi) - V - diag(vi), the system is approximately LQG 
balanced, such that 

S = P ^ M = diag(pi) 

( 21 ) 

Pi = {Pi - l)/ 2 7 i r Pi = 1 + ^Qili/Ci^ 

Next it is shown that the weight Q 

Q — diag{ 0, 0, qj 2 , ...0, 0), ft < Qoi (22) 

shifts the ith pair of complex poles of flexible structure, 
and leaves the remaining pairs of poles almost unchanged. 
Only the real part of the pair of poles is changed (just mov- 
ing the pole apart from the imaginary axis and stabilizing 
the system), and the imaginary part of the poles remains 
unchanged. Namely, for the weight Q as in Eq. (22), the 
closed-loop pair of flexible poles (A cr ,-, ±jX c u) relates to 
the open-loop poles (A ort -, izjX 0 u) as follows: 

(A cr i, dhjActt) =: {Ppi^ori} dzj\oii}> 2 = li **** 71 (23) 

where P pi is defined in Eq. (17). The proof is presented in 
the Appendix. 

The real part of the poles is shifted by while the 
imaginary part remains unchanged. The above proposition 
has additional interpretations. Denote the real part of 
the open-loop pole by A or! - = — C»u>* and the real part of 
the closed-loop pole by A cr j = — C« w *i note a ^ so that the 
height of the open-loop resonant peak is a 0 i = Jc/2 CiWi, 
where k is a constant, and the closed-loop resonant peak 
is a c i = /c/2^ C tU;j. From Eq. (23), Ppi — A cr i/A ori -; thus, 
one obtains 


i.e., P p i is a ratio of closed- and open-loop damping factors, 
or it is a ratio of open- and closed-loop resonant peaks. 
Therefore, if a suppression of the ith resonant peak by the 
factor Ppi is required, the appropriate weight q { is deter- 
mined from Eq. (17): 

qi = 0.5(/?pj - 1)0. 7,- 2 (25) 

An alternative interpretation of Pi is as a ratio of the 
open-loop Hankel singular value to the closed-loop Hankel 
singular value: 

A = Toihli ( 26a ) 

or a ratio of variances of open-loop (u^) an< ^ closed-loop 
(tr^-) states excited by the white-noise input [13]: 

Pi = *11*1 (26b) 

The proof is presented in the Appendix. 

The plots of Ppi with respect to qi and ji are shown in 
Fig. 7. They show relatively large P p i even for small fc, i.e., 
a significant pole shift to the left. Also, P pi increases with 
the increase of 7 {, and decreases with the increase of 
i.e., there is a significant pole shift for highly observable 
and controllable states with small damping. In terms of 
the transfer function profile, the weight qi suppresses the 
resonant peak at frequency LJi while leaving the natural 
frequency unchanged. Due to weak coupling between the 
states, the assignment of one pair of states does not signif- 
icantly impact other states. Thus, the weight assignment 
can be done separately for each pair of states. 

The estimator poles are shifted in a similar manner. 
Denote 

V = diag{ 0 , 0, ..., 1 ^/ 2 , 0), v { < v oi (27) 

then for the weight V as in Eq. (27) and Vi < v 0 i 7 the 
estimator pair of poles (A er i, ±jA ei j) relates to the open- 
loop poles (A or ,’, ±j\ 0 a) as follows: 

(A er j, ^tj^eii) — {Pei^orii iJA 0 j|), (28) 

where P e i is defined in Eq. (19). 
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(29) 


The limiting values q 0 i and v Q i are determined. Their 
values are rather fuzzy numbers. Despite their fuzziness, 
they are not difficult to determine. There are several indi- 
cators that qi is approaching q 0 i , or that Vi is approaching 
v 0 {. For the controller, q Q i is the weight for which the 
ith pair of complex poles of the plant departs from the 
horizontal trajectory in the root-locus plane, or it is the 
weight for which the ith resonant peak of the plant trans- 
fer function disappears (the peak is flattened). For the 
estimator, v 0 i is the covariance for which the zth pair of 
complex poles of the estimator departs from the horizon- 
tal trajectory in the root-locus plane, or it is a covariance 
for which the ?th resonant peak of the estimator transfer 
function disappears. 

It is crucial from an implementation point of view to ob- 
tain a compensator of the smallest possible dimension that 
preserves the stability and performance of the full-order 
compensator. Although the size of a plant determines the 
size of a compensator, the plant model cannot be reduced 
excessively in advance in order to assure the quality of the 
closed-loop system. Therefore, compensator reduction is a 
part of the compensator design. The balanced LQG design 
procedure provides this opportunity. 

To successfully perform the compensator reduction, an 
index of the importance of each compensator component 
is introduced. In the open-loop case, Hankel singular val- 
ues serve as reduction indices. In the closed-loop case, the 
characteristic values of the system seem to be good can- 
didates for the reduction indices, as suggested in [9]. This 
is not a good choice, however, since the characteristic val- 
ues do not properly reflect the effectiveness of the com- 
pensator. The effectiveness of the closed-loop system can 
be evaluated by the relative suppression of the closed-loop 
output when compared to the open-loop output. Thus, the 
ratio of variances of the closed-loop output and the open- 
loop output excited by the white noise input is an appro- 
priate measure of the suppression (alias the compensator 
performance). It will be shown later that the suppression 
depends on pole mobility in the complex plane. Therefore, 
if a particular pair of poles is easily moved (i.e., in the case 
when small weight is required), the respective states are 
easy to control and to estimate. On the contrary, if a par- 
ticular pair of poles is difficult to move (i.e., a large weight 
is required to move the poles), the respective states are 
difficult to control and to estimate. In the latter case, the 
action of the compensator is irrelevant, and the states that 
are difficult to control and estimate can be reduced; thus, 
pole mobility is a good indicator of the importance of a 
particular compensator state. 

Consider an LQG balanced system, and denote the pole 
mobility index tt by 


TTi - 0.5 (/?; - 1) 

Note that for /?,• = 1, the ith pole is stationary and tt,- is 
equal to zero; for a shifted pole, one obtains > 1 and 
7T| > 0. If for small 7r,- a small pole shift (in plant, as well as 
in estimator) is observed, this component can be reduced. 

Another useful interpretation follows from Eqs. (29) 
and (26): 

*i = 0-5(7oi - 7ci)/7ci = 0.5(<r^ - of, )/o£ (30) 

i.e., the pole mobility index is proportional to the relative 
change in the white noise response of the open- and closed- 
loop systems. Furthermore, from Eqs. (21) and (29), it can 
be shown that the pole mobility index is a product of the 
square of a Hankel singular value and the characteristic 
value of the system: 

n = j? m (31) 

Thus, 7Tj combines the system observability and controlla- 
bility properties of the open-loop system with the closed- 
loop performance. The more heavily weighted the com- 
ponent, the larger its pole mobility index, see Fig. 8(a). 
Also, the larger the Hankel singular value of the compo- 
nent, the larger the corresponding pole mobility index, c.f. 

Fig. 8(b). 

The matrix n of pole mobility indices is defined as 

n = rfiaff(Ti,T 3> ...,ir n _i,jr f ,) (32) 

and it is obtained from Eqs. (15) and (29) as 

n = t 2 m (33) 

In the following, a reduction technique is discussed. As- 

sume n in Eq. (32) has a descending order, i.e., i r,* > 0, 
TTi+i < Try , i = 1, ...,n, and divide H as follows: 

n = diag(U r , n f ) (34) 

where n r consists of the first k entries of n, and U t the 
remaining ones. If the entries of n* are small in compari- 
son with the entries of n r , the compensator is reduced by 
truncating its last n — k states. 
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(35a) 


It is shown in [11] that a system with the reduced-order 
compensator obtained by reducing states with small II t is 
expected to be stable. That is, although it is not guar- 
anteed, there is a well-founded expectation to obtain a 
stable system with the reduced-order compensator. Also, 
the estimation errors of the full-order and reduced-order 
compensators are approximately the same. 


V. Closed-Loop System 

The LQG compensator configuration for the DSN an- 
tenna is shown in Fig. 1. The tracking command y c is 
compared with the estimated antenna position y pt and the 
error e — y p — yc along with the integral e, 1 of the error 
e are the plant inputs. The equations for the integrator, 
plant, and the estimator are, respectively, 


e{ = e 

x = Ax 4- Bu, y p = C p x , xj - Cjx 
x — Ax 4 Bu 4 K e ( y p Cpi?) 


u = —Kjxj — K p e — Ki^i 
Denoting x T cX = [ef x T x T }> one obtains 


x c t = A c iXci 4 B c ty C} y = C c \x c \ 


where 


(35b) 


(35c) 


(36a) 



■ 0 
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1 


■ -i - 

II 

-BKi 

A —BKjCj — BKpCp 

j B c i — 

BK P 


,—BKi 

K e C p A — BKjCj — BI\ p C p K e C p . 


,BK p . 


(36b) 


The triple (A c f, B c t , C c \) for the LQG system with the reduced-order estimator ( A r) B r , C r ) is as follows: 



1 

O 

o 


■ -I - 

Ac ,= 

-BI<i A -BKjCjr - BKpCpr 

, Bd = 

BKp 


.—B r K% I\ er Cp A r — BrKfCfr — BfKpCpr — A er Cpr . 


. B r I<p . 


(37) 


where C pr and Cj r are obtained from the partition of C r > 


VI. Compensator Design Algorithm 

Weights for the balanced LQG controller and estimator 
are identical. Therefore, in the algorithm, the controller 
and estimator gains are adjusted simultaneously. The pro- 
cedure for the antenna LQG compensator design is a se- 
quential one. First, for the ad hoc (but relatively small) 
chosen tracking subsystem weights, the flexible subsystem 
weights are determined (recall that the controller and es- 
timator weights are the same). Second, the adjustment 
of the tracking system weights is performed, followed by 
final adjustment of the flexible system weights. The flexi- 
ble subsystem weights are determined sequentially, which 
gives more insight into the system performance and sim- 
plifies the procedure. 


V = [Cpr Cjrl 


The estimator order is determined as a part of the 
weight tuning process. Only the modes with large index 
TTj are considered. If the number of flexible modes is nj, 
the number of disregarded modes is n 0 , and the size of 
the tracking system is n f , then the controller order n c is 
n c = n t 4 2(ny - n 0 ). The following LQG compensator 
design algorithm is proposed: 

(1) Determine the plant state-space representation in 
the form of Eq. (8), consisting of flexible and track- 
ing parts. 

(2) Choose ad hoc but reasonably small weights and 
variances for the tracking part Q t = V% — Q tah- 
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(3) For each balanced coordinate of the flexible part, 

choose the weight ft, i = 1 , and define the 

weight and covariance matrices Qy, = 
diag (0,0,..., ft, ft, 0, 0, ...0), V fi = <5/ t so that 
the closed-loop system performance for the weight 
Qi = cfwi£((2tah 5 Qy») and the covariance Vi = Qi is 
optimized. For example, determine the weights qi to 
impose the required pole shift or to suppress the ith 
resonant peak to the required level without depre- 
ciating other properties of the closed-loop transfer 
function. Note that for small ft, only the ith pair 
of poles is shifted (to the left), and the remaining 
poles are almost unaffected. Disregard the modes 
with small index 7 r, for which the weighting does 
not improve the closed-loop system performance. 

(4) For the already determined Qj and Vy, tune weight 
Qt and assume covariance V% = Qt to obtain im- 
provements in tracking properties of the antenna. 

(5) Adjust flexible subsystem weights, if necessary. 

VII. Applications 

A balanced LQG compensator was designed for the 
DSS-13 antenna. The DSS-13 antenna model consists of 
two tracking states (azimuth and elevation angle) and 13 
flexible modes (or 26 balanced states). The preliminary 
weights qi e = q pe = ft a = q pa = 1 for the tracking sub- 
system (for yi and y p ) and zero weights for the flexible 
subsystem (ft = q 2 — ... = ft 3 = 0 for all 13 modes) 
were chosen. The closed-loop system step response is pre- 
sented in Fig. 9 (azimuth encoder reading due to azimuth 
command) and the magnitudes of the closed-loop trans- 
fer function in Fig. 10. Both figures show that flexible 
motion of the antenna is excessive and should be damped 
out. This is achieved by adjusting weights for the flexible 
subsystem. For the tracking weights as before, the weight 
for the first mode (2.32 Hz) is chosen to be ft = 10 -7 , and 
the remaining weights are zero, obtaining the closed-loop 
system responses as shown in Figs. 11 and 12. One can see 
that the 2.32-Hz resonance peak in the azimuth command 
response (Fig. 11) has disappeared, along with most of the 
flexible motion in the azimuth step response (Fig. 12). The 
elevation motion is unaffected, however, since the azimuth 
mode is almost nonexistent in the elevation motion. 

The w T eight should be chosen carefully. Weight that is 
too small (e.g., 3 X 10~ 9 in the case considered) will not 
suppress the resonant peak, Fig. 13(a). Weight that is 
too large (e.g., 10 -5 ) will deteriorate the tracking perfor- 
mance: for the overweighted mode the transfer function is 
pressed down within a wide frequency range, Fig. 13(b). 
The proper weight suppresses the resonant peak, leaving 
the other peaks unchanged. 


Similar procedures have been applied for the second 
(2.64-Hz), third (4.26-Hz), fourth (3.77-Hz), fifth (7.92- 
IIz), sixth (4.47-IIz), seventh (3.38- Hz), eighth (5.98-IIz), 
ninth (7.32-Hz), and tenth (9.48-Hz) modes, with weight 
10“ 7 for each mode. As a result, the suppression of the 
remaining flexible motion is observed as shown in Fig. 14. 
Weights for the remaining modes (eleventh through thir- 
teenth) have been set to zero. 

The root locus of the closed-loop system due to weight 
variations of the 7.92-Hz (fifth) mode is shown in Fig. 15. 
The figure shows the horizontal departure of poles to the 
left (stabilizing property). It confirms the properties of 
the weighted LQG design described previously. 

In the next step, the tracking properties of the system 
are improved by proper weight setting of the tracking sub- 
system. Namely, setting the integral weight to ft> = ft a = 
70 and the proportional weight to q pe = q pa = 100 im- 
proves the system tracking properties, as shown in Fig. 16 
(small overshoot and settling time) and in Fig. 17 (ex- 
tended bandwidth up to 2 Hz). However, by improving the 
tracking properties, the transfer function has been raised 
dramatically in the frequency region of 1 to 3 Hz, which 
forces the first two modes located in this region to ap- 
pear again in the step response. By sacrificing a bit of 
the tracking properties, the flexible motion in the step re- 
sponse is removed. This is done by slightly increasing the 
weights of the flexible subsystem, setting them as follows: 
?i = 72 = ?3 = = ?5 = ?6 = 10 -6 , ?7 = ?8 = 10~ 7 , and 

q 9 = fto — 10 -5 . The closed-loop system response with 
satisfactory tracking performance is shown in Figs. 18 and 
19 (small overshoot, settling time, and 1 - IT z bandwidth 
are observed). 

The reduced-order compensator is obtained through 
evaluation of pole mobility indices 7T{ . The plot of ? r* is 
shown in Fig. 20. Reducing the order of the estimator 
to 12 states [the first four are tracking states (states 1 to 
4), the next six are flexible states (states 5 to 10), and 
the last two states are nonflexible components of the rate 
loop model (states 25 and 27)] gives the stable and accu- 
rate closed-loop system. The reduced-order compensator 
is compared with the full-order compensator in the step 
response plots in Fig. 21, and the transfer function plots 
in Fig. 22, showing satisfactory approximation. 

VIII. Conclusions 

A method for designing a reduced-order compensator 
for the DSS-13 antenna has been presented. A balanced 
LQG compensator has been introduced that uses the same 
amount of effort to control and to estimate the system. 
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The properties of the balanced LQG system are used 
to obtain a reduced-order compensator for the antenna. 
This compensator preserves the stability and performance 


of the full-order compensator. The performance of the 
reduced-order compensator has been verified by simula- 
tions. 
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Fig. 3. Magnitudes of transfer function of tracking and flexible subsystems for the eleva- 
tion rate Input. 



Fig. 4. Magnitudes of transfer function of the closed loop system for different weights of 
elevation angle to elevation command. 













POLE SHIFT COEFFICIENT p 



Fig. 7. Pole shift factor /3 versus: (a) weight q and (b) Hankel singular value 









ELEVATION AND AZIMUTH ENCODER ANGLE 



Fig. 9. Azimuth encoder response to azimuth step command for the unit proportional and in- 
tegral weights in azimuth and elevation, and zero weight for flexible subsystem. 



Fig. 10. Transfer function (elevation and azimuth encoder due to azimuth command) for 
weights the same as those in Fig. 9. 
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AZIMUTH ENCODER ANGLE 



FREQUENCY, Hz 

Fig. 11. Transfer function (elevation and azimuth encoder to azimuth command) for weights 
the same as those In Fig. 9, but for q ft = 10 -7 . 



TIME, sec 


Fig. 12. Azimuth encoder response to azimuth step command for weights the same 

as those In Fig. 11. 



MAGNITUDE, AZIMUTH ENCODER ANGLE MAGNITUDE, AZIMUTH ENCODER ANGLE 



10” 2 10 _1 10° 10 1 10 2 

FREQUENCY, Hz 

Fig. 13. The first component: (a) underweighted and (b) overweighted. 
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IMAGINARY PART AZIMUTH ENCODER ANGLI 



TIME, sec 


Fig. 14. Azimuth encoder response to azimuth step command for weights the same as 
those In Fig. 9, but for g/i = ... = gfio = 10~ 7 . 



Fig. 15. Root locus for 7.92-Hz mode versus weight. 



ELEVATION AND AZIMUTH ENCODER ANGLE 



Fig. 16. Closed-loop response to step Input for proportional weight 100 and integral weight 
70 (both in azimuth and elevation), and weights for flexible subsystem equal to 10 -7 (azimuth 
encoder to azimuth command). 



Fig. 17. Closed-loop transfer function for weights the same as those In Fig. 16 (elevation en- 
coder and azimuth encoder to azimuth command). 
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ELEVATION AND AZIMUTH ENCODER ANGLE js =• -n AZIMUTH ENCODER ANGLE 


TIME, sec 


jg, 18. Azimuth encoder response to azimuth step command for proportional weight 100 
itegral weight 70 (In azimuth and elevation), and for flexible weights gn = ... = g/e = 10 “ 
77 = g/g = 10“ 7 , and q f g - qno = 10” 5 * 


- ELEVATION ENCODER 

- AZIMUTH ENCODER 



FREQUENCY, Hz 


Fig. 19. Closed-loop transfer functions for weights the same as those In Fig. 18 (elevation 
encoder and azimuth encoder to azimuth command). 




AZIMUTH ENCODER ANGLE 



Fig. 22. Transfer function (azimuth encoder to azimuth command) of the DSS-13 antenna with 
full- and reduced-order compensators. 
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Appendix 


I. Selected Properties of Flexible Systems 

The balanced grammian for a flexible system with n 
components (or N — 2n states) has the following form 
[7,8]: 

d»aj(7i,7i,72,72. -»7n.7n) (A-l) 

The system matrix A is almost block diagonal, with dom- 
inant 2x2 blocks on the main diagonal: 


There are two solutions of Eq. (A-6), but for a stable sys- 
tem and for qi = 0 it is required that s 4 — 0, therefore 
Eq. (17) is the unique solution of Eq. (A-6). 

III. Proof of Eq. (23) 

For small <jf,, the matrix A of the closed-loop system is 
diagonally dominant A 0 = diag (A ot ), i = l,...,n, and 
A 0 i = Ai “ BiBj Si, Introducing Eq. (A-3), one obtains 

Aai Sf Ai + 2sijf(Ai + Aj ) (A-7) 


A = diag(Ai ) , Ai 


-C*Wi -Wi 
LJi — C*Wi 


i = 1, ..., n (A-2) 


and introducing Ai as in Eq. (A-2) to Eq. (A-7) one obtains 


where w,- is the ith natural frequency of the structure, and 
(i is the ith modal damping. The matrices B and C are 
divided into two blocks, comparably to A, B T = [Bj , 
Bj,..., B%], and C = [C U C 7 , C n ], with the following 
property: 


A 0 i — 


-PpiCiWi — W< 

wi -PpiQui 


with ft P i as in Eq. (17). 


(A-8) 


BiBj a cf C, a - Ti 2 (a, + aD (A-3) 

II. Proof of Eq. (17) 

Due to the diagonally dominant matrix A for a flexi- 
ble structure in balanced representation, and for Q as in 
Eq, (16), there exists <7* < q 0 %> i = 1> • ••, such that the so- 
lution S of the Riccati Eq. (4) is also diagonally dominant 
with 2x2 blocks Si on the main diagonal: 

Si* 8il 7 , s { > 0, i — 1, ..., n (A-4) 

Thus, Eq. (4) turns into a set of the following equations: 

Si(Ai + Aj ) - s^BiBj + qih = 0, i = 1, ..., n (A-5) 

For a balanced system BiBj = —y? (A, + Aj ) and A, + 
Aj = —2(iU>il2, see Eqs. (A-3) and (A-2), respectively. 
Therefore, Eq. (A-5) is now 


IV. Proof of Eqs. (26a) and (26b) 

In order to prove Eqs. (26a) and (26b), the closed-loop 
Lyapunov equation is considered: 

(A - BB t S)T 2 c + T\(A - BB t S) t + BB t “ 0 (A-9a) 

or, for the ith pair of variables, 

{Ai- BiBj sfrlWAAi- BiBj Si ) T +BiBj = 0 (A-9b) 

Introducing Eq. (A-3) gives 

7c, + 2 7ci7oiSi - 7 2 , = 0 (A-10) 

or 

llihli = 1 + 2s i7oi (A- 11) 


s i + Si/7 f ~ 0-5gi/C,' w i7, ? = 0. » = 1> — , n 


(A-6) Comparing Eq. (A-ll) and Eq. (17) gives Eq. (26a). 
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Performance Results of a 300-Degree Linear Phase 
Modulator for Spaceborne Communications 

Applications 

N. R. Mysoor and R. O. Mueller 

Spacecraft Telecommunications Equipment Section 


A phase modulator capable of large linear phase deviation, low loss , and wide- 
band operation with good thermal stability has been developed for deep space space- 
craft transponder (DST) applications at X-band (8.415 GHz) and Ka-band (32 GHz) 
downlinks. The design uses a two-stage circulator-coupled reflection phase shifter 
with constant gamma hyperabrupt varactors and an efficient modulator driver cir- 
cuit to obtain a phase deviation of ± 2.5 rad with better than 8 percent linearity . 
The measured insertion loss is 6.6 dB ±0.35 dB at 8415 MHz . Measured carrier 
and relative sideband amplitudes resulting from phase modulation by sine wave and 
square modulating functions agree well with the predicted results. 


I. Introduction 

The telecommunication transponders [1] for Cassini and 
other future deep space missions require an X-band linear 
phase modulator. The design is to provide ±2.5 rad of 
peak phase deviation to accommodate downlink modula- 
tion of telemetry and ranging signals. The tolerance on the 
phase deviation linearity is 8 percent. The phase modula- 
tor design specifications [2] for the deep space transponder 
applications are listed in Table 1. The modulator design 
concepts, development, analysis, and measured results are 
presented in this article. The description of the bread- 
board phase modulator is presented in Section II, the test 
data and analysis are presented in Section III, and the 
conclusions are presented in Section IV. 


II. Description of the Phase Modulator 

The X-band phase modulator is designed as a cascaded 
two-stage circulator-coupled reflection phase shifter [2,3,4] 
with constant gamma hyperabrupt varactors. The gallium 
arsenide (GaAs) varactor diode is well suited for the phase 
modulator application as it can provide rapid phase change 
with the applied voltage. The diode capacity parame- 
ter, gamma (r), is the slope of the capacitance-voltage 
curve as plotted on log-log paper. The gamma of a typical 
hyperabrupt varactor varies widely with applied voltage, 
which makes these varactors unsuitable for linear phase- 
modulator applications. Recent device processing develop- 
ments [4] have enabled the construction of varactors with 
gammas that remain constant over a limited voltage bias 
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range. The GaAs varactors 1 used in this investigation have 
a constant gamma equal to 1.5. The varactor diode is usu- 
ally modeled as a junction capacitance in series with the 
resistance of the GaAs substrate and epitaxial layers. The 
junction capacitance of varactor diodes is modeled as 


wy) _r. iv] 

Co * 


( 1 ) 


where Cj(V) is the junction capacitance at reverse bias 
voltage, V. The value Co is the junction capacitance at 
V = 0, and <I> is the built-in potential, which is equal to 
1.3 V for GaAs. 

The series resistance (Rs) of varactor diodes can be 
determined from the measured diode’s quality factor, Q. 
The Q factor is a measure of loss in the varactor diode. 
The expression for Q as a function of the bias voltage, 
used in the simulation, was obtained by curve fitting the 
manufacturer’s (Microwave Associates’) data at 50 MHz: 

Q(V) = 1538 - 115.6|V| + 118.2| Kj 2 + 21.96[V| 3 

- 1.683|V| 4 (2) 


The diode series resistance is then given by 

Rs(V) = [2tt x 50 x 10 6 x Q x Cj(V)]~ l (3) 

A phase modulator consisting of a two-section reflection 
phase shifter [2] and a driver circuit was breadboarded and 
tested. The phase modulator photograph and schematic 
are illustrated in Figs. 1 and 2, respectively. The module is 
made out of aluminum alloy 6061, which is tin-lead plated 
to ensure ease of soldering and good electrical conductivity. 
The size of the module is 81 x 61 x 14.3 mm. The two- 
section phase shifter layout is etched on a 0.508-mm-thick 
RT/Duroid 6002 soft substrate of a dielectric constant 
equal to 2.94. The constant 1.5 gamma varactors 2 ^tre 
mounted with a contact strip at the end of an optimized 
30-ohm, 2.794-mm-wide transformer. The diode capaci- 
tance at —4 V is equal to 0,65 pF. The diode package par- 
asitic capacitance and inductance are equal to 0.23 pF and 
0.20 nH, respectively. TRAK Microwave 6.35-mm micro- 
puck circulators [79*9001] and isolators [89*9001] are used 


1 The varactors were manufactured by Microwave Associates, 
MA-4641 1-126. 

2 Ibid. 


in this breadboard. The driver circuit is etched on 0.787- 
mm-thick RT/Duroid 5880, which has a dielectric constant 
of 2.2. The phase modulator driver circuit schematic is 
shown in Fig. 3. The functions of the phase modulator 
drive circuit are to sum and amplify the modulation in- 
put signals and to provide composite drive voltage to the 
varactor diodes. The modulation input signals include 
the spacecraft telemetry, ranging, and differential one-way 
ranging (DOR) signals. The modulation frequency range 
is from 1 kHz to 20 MHz. The selected wideband op-amp 
for this application is Comlinear CLC 505. 


III. Experimental Results 

The measured values of phase deviation and inser- 
tion loss for the phase modulator are compared with the 
predicted results in Fig. 4. The predicted results were 
obtained by the computer microwave computer-assisted- 
design (CAD) simulation of an extensive circuit model of 
the circuit schematic shown in Fig. 2. The measurements 
were conducted using a test RF signal level of +9 dBm. 
The insertion loss for this unit is 6.35 ±0.25 dB, the phase 
deviation is ±150 deg, with a linearity better than ±8 per- 
cent of a best-fitted straight line (BSL). The varactor bias 
voltage range is 5 ±3.5 V. The model accurately predicted 
a linear phase slope of 42 deg/V. The predicted insertion 
loss, including circulator and isolator losses, is 7.0 ±0.4 dB, 
which is about 0.65 dB higher than the measured values. 
The discrepancy in the insertion loss is most likely due to 
the uncertainty in the measured insertion loss of the circu- 
lators and isolators. The insertion loss of these devices was 
measured in a separate test fixture. Circulators measured 
in this test fixture had a port-to-port one-way insertion 
loss of 1 ±0.1 dB. The test fixture is not enclosed and, 
therefore, has radiation losses which the module does not 
have. Also, circulators were not epoxied in the test fixture 
as they are in the module; this will result in higher losses 
since there is not a solid ground contact. The measured 
results of the phase modulator are compared with its de- 
sign specifications in Table 1. The phase modulator meets 
or exceeds all specifications. 

Figure 5 shows the predicted and measured gain versus 
frequency characteristics of the phase modulator driver cir- 
cuit at -35 deg C, ±25 deg C, and ±85 deg C. A separate 
breadboard was used to test the driver circuit. The mea- 
sured response is flat and drops only 0.4 dB, from 1 kHz to 
20 MHz. The measured 3-dB bandwidths at -35 deg C, 
±25 deg C, and ±85 deg C are 92 MHz, 84 MHz, and 
75 MHz, respectively. The measured results are actually 
better than the Simulation Program with Integrated Cir- 
cuit Emphasis (SPICE) predicted circuit response using a 
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conservative circuit model for the op-amp integrated cir- 
cuit. The model correctly predicted a larger bandwidth at 
lower temperatures. The dc power consumption with an 
8-V peak-to-peak output swing is 112 mW; when no signal 
is applied, the power consumption is 32 mW. 

Sinusoidal and square modulating waveforms were ap- 
plied to the phase modulator, and their resulting spectra 
[5,6] were monitored on a calibrated spectrum analyzer. 
All measurements were performed at 25 deg C and with a 
carrier frequency of 8415 MHz. Figure 6 shows the result- 
ing spectrum when a sinusoidal modulating wave was ap- 
plied to the phase modulator. The modulating frequency 
used is 1 MHz, and the peak modulation index is 2.4 rad. 

A comparison of measured and predicted carrier, first, 
second, and third sideband levels for the case of sinusoidal 
modulation is shown in Fig. 7. A modulation frequency of 
10 MHz was used in the measurements. The peak phase- 
modulation index, /?, ranges from 0.2 to 2.4 rad in 0.2-rad 
steps. The predicted relative carrier and sideband levels [6] 
were computed by evaluating the appropriate Bessel func- 
tions of the first kind of order n, J n (/?). The subscript n is 
an integer. It represents the carrier for n equal to zero and 
sidebands for n not equal to zero. The relative carrier level 
for a given modulation index is determined by evaluating 
20 log[|J 0 (/?)|]. Similarly, the relative sideband levels may 
be determined by evaluating 20 log[| </ n (/?)|] f° r M > 0. 
Figure 7 shows excellent agreement between theory and 
measurement for sinusoidal phase-modulating waves. Neg- 
ligible amplitude modulation distortion was observed in 
this case. 

Figure 8 shows the spectrum that results when a square 
modulating wave at a frequency of 100 kHz and a peak 
modulation index of 1.571 rad is applied to the phase mod- 
ulator. A comparison of measured and predicted carrier, 
first, third, and fifth sideband levels for the case of square- 
wave modulation is shown in Fig. 9. A modulation fre- 
quency of 100 kHz was used in the measurements. The 
relative carrier level in decibels is determined [6] by evalu- 
ating 20 log[| cos(/?)|], where /? is the peak modulation in- 
dex. The relative sideband levels may be determined [6] by 
evaluating 20 log[|(2/n7r) sin(/?)|] , where n is the number 


of the sideband. Excellent agreement between predicted 
and measured square-wave modulation results is demon- 
strated in Fig. 9. 

The data illustrated in Figs. 7 and 9 were obtained by 
the judicious use of a spectrum analyzer. For small mod- 
ulation indices, setting the first sideband level is the most 
accurate way [5] to obtain a particular value of a modula- 
tion index. For example, if a modulation index of 0.2 rad 
(sinusoidal modulation) is needed, then the modulation 
power level should be adjusted until the first sideband level 
falls 20.0 dB below the unmodulated carrier level. The er- 
ror in the measurement is determined by the accuracy of 
the spectrum analyzer. If the above reading is accurate to 
within 1 dB, then the modulation index would be accurate 
to within 0.03 rad. For larger modulation indices, the first 
sideband will give large errors. In this case, the carrier 
level should be used to set the modulation index. The de- 
cision of whether to use the carrier or the first sideband 
level can, in general, be made by choosing the one that has 
the steepest slope [5] near the desired modulation index. 

IV. Conclusions 

A two-section analog X-band reflection phase modu- 
lator with an efficient driver circuit was developed. The 
phase modulator performance is accurately predicted by 
the theoretical modeling and simulations. By using hy- 
perabrupt junction GaAs varactor diodes of a constant 
gamma of 1.5, a full ±2. 5-rad deviation phase modulator 
was realized by cascading two phase shifter circuits. An 
efficient phase modulator driver circuit was also developed 
and tested. The measured gain-frequency characteristics 
of the modulator driver circuit over the design temper- 
ature range agreed well with the predicted results. The 
modulator driver 3-dB bandwidth and dc power consump- 
tion at 25 deg C are 84 MHz and 112 mW, respectively. 
At 8415 MHz, the measured phase shift and insertion loss 
of the phase modulator are equal to ±2.5 rad, with better 
than ±8 percent linearity, and 6.35 ±0.25 dB, respectively. 
Measured carrier and sideband spectra showed excellent 
agreement with theory with negligible amplitude modula- 
tion distortion for both sinusoidal and scpiare phase mod- 
ulating waves. 
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Table 1. Measured results and design specifications for the 
phase modulator. 


Parameter 

Specification 

Measurement (25 deg C) 

Insertion loss 

10 dB, max 

6.6 dB, max 


_ . 

Amplitude modulation 

±0.5 dB, max 

±0.15/— 0.35 dB 


Input return loss 

14 dB, min 

25 dB (8415 ±50 MHz) 


Output return loss 

14 dB, min 

33 dB (8415 ±50 MHz) 


RF input level 
(for linear operation) 

±10 dBm, max 

±10 dBm, max 


Linearity 

±5 percent 
(best straight line) 
to ±2 rad 

±4.8/— 3.1 percent 


Modulation sensitivity 

2 ±0.2 rad/V 

2.1 rad/V 

- ■ 

Modulation input 
signal level at ±2.5 rad 

±2.5 V, max 

±1.28 V, max 


Op-amp gain 

RF modulation bandwidth 

— 

2.8 V/V 


— 0.25 dB bandwidth 

20 MHz, min 

28 MHz 


—3 dB bandwidth 

DC power (at 20 MHz 
and ±2.5 rad) 

82 MHz, min 

160 MHz 

= 

+ 5 V 

25 mA, max 

7.0 mA 


-5 V 

25 mA, max 

6.9 mA 


Modulation voltage range 
(to varactors) 


-1.5 to -8.5 V 


Ill 
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Fig. 2. DST X-band phase modulator schematic. 
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Fig. 3. DST phase modulator driver circuit schematic. 
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Fig. 4. Measured and predicted results for the phase modulator. 
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Fig. 5. Measured and predicted gain versus frequency for the 
phase modulator driver circuit at temperatures of -35, 25, and 
85 deg C. 



FREQUENCY, MHz 

Fig. 6. Measured phase-modulated spectrum for the case of a 
sinusoidal modulating wave at a frequency of 1 MHz and a peak 
modulation Index of 2.4 rad. 



Fig. 7. Measured and predicted carrier, first, second, and third 
sideband levels versus peak modulation Index for the case of a 
sinusoidal modulating wave at a frequency of 10 MHz. 



Fig. 8. Measured phase-modulated spectrum for the case of a 
square modulation wave at a frequency of 100 kHz and a peak 
modulation index of 1.571 rad. 
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PEAK MODULATION INDEX, deg 

Fig. 9. Measured and predicted carrier, first, third, and fifth side- 
band levels versus peak modulation Index for the case of a square 
modulating wave at a frequency of 100 kHz. 



116 




TDA Progress Report 42-112 


- 3 t- 

/V6?V2-- 

p n 

r/ February 15, 1993 


N93-2467I 


Management of Space Networks 
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NASA has proposed missions to the Moon and Mars that reflect three areas of 
emphasis: human presence , exploration, and space resource development for the 
benefit of Earth. A major requirement for such missions is a robust and reliable 
communications architecture. Network management — the ability to maintain some 
degree of human and automatic control over the span of the network from the space 
elements to the end users on Earth — is required to realize such robust and reliable 
communications . This article addresses several of the architectural issues associated 
with space network management. 

Round-trip delays, such as the 5- to 40-min delays in the Mars case, introduce 
a host of problems that must be solved by delegating significant control author- 
ity to remote nodes. Therefore, management hierarchy is one of the important 
architectural issues. 

The following article addresses these concerns, and proposes a network man- 
agement approach based on emerging standards that covers the needs for fault , 
configuration, and performance management , delegated control authority, and hier- 
archical reporting of events. A relatively simple approach based on standards was 
demonstrated in the DSN 2000 Information Systems Laboratory, and the results 
are described . 


I. Introduction 

NASA has proposed missions to the Moon and Mars 
that reflect three areas of emphasis: human presence, ex- 
ploration, and space resource development for the b.enefit 
of Earth. The Moon is a natural test-bed to prepare for 
missions to Mars through simulation, systems testing, op- 
erations, and studying human capabilities. 

Communications with the Moon should be relatively 
straightforward with existing Deep Space Network (DSN) 
systems because lunar operations are initially planned to 
be centered at a main base on the near side of the Moon. 


However, a notable design issue will be communications 
from remote sites on the Moon. Such issues will increase 
the complexity of the space network. 

The report on America’s Space Exploration Initiative, 
America at the Threshold [I] describes the increased com- 
plexity of Martian communications as follows: 

“Providing communications for the Martian missions 
is considerably more challenging than for lunar missions. 
Mars can be as much as 1,000 times more distant from 
Earth than the Moon, which results in a spatial signal loss 
one million times greater. In addition, Mars rotates at 
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about the same rate as the Earth, putting surface locations 
out of direct touch for over 12 hours at a time.” 

A tentative communications architecture for Mars 
(Fig. 1) proposes Mars-synchronous relay satellites to pro- 
vide continuous coverage of surface elements as well as 
orbital elements [1]. The Martian main base may also 
have the ability to communicate directly with Earth when 
in view. Again in this case, the complexity of the space 
network will be increased as mobility and distribution of 
space elements are factored into the mission design. 

A major requirement for such a lunar and Martian com- 
munications system is network management— the need to 
maintain some degree of human and automatic control 
over the network to assure highly reliable and robust com- 
munications from the space elements to the end users. 

The following article addresses these issues, and pro- 
poses a network management architecture based on emerg- 
ing commercial technology that covers the needs for fault, 
configuration, and performance management, delegated 
control authority, and hierarchical reporting of events. An 
approach based on standards was demonstrated in the 
DSN 2000 Information Systems Laboratory, and the re- 
sults are described in this article. 

II. Requirements 

A. Space Management Network 

Figure 2 illustrates a simplified schematic of the pri- 
mary network to the Moon and Mars that will support 
science and human exploration. The figure illustrates the 
multiple paths between data sources and destinations, and 
the redundancy that is built into the primary architec- 
ture. For example, communications between one of the 
DSN antennas at a Deep Space Communications Complex 
(DSCC) and the Mars main base (Fig. 2(b)) may be direct 
or routed through a relay satellite. Data from the Mars 
habitat to a remote scientific instrument on Mars may be 
direct or routed through facilities on the relay satellite. 

Round-trip delays, such as the 5- to 40- minute delays in 
the Mars case, introduce a host of problems that must be 
solved by delegating significant control authority to remote 
nodes. 

This article proposes a Space Management Network 
(SMN) to support transmission of management data from 
all the network elements (from the space elements to the 
end users) to Earth-based operations centers. The SMN 
is a logical network that can be distinguished from the 
primary network because its chief function is to support 


transmission of management data. The SMN may use ded- 
icated facilities or share facilities with the primary net- 
work. 

Major nodes in the SMN include the mission and sci- 
ence support centers, the DSN, the lunar main base, and 
the Mars main base. The SMN may interface to each of 
these processing end points, as well as intermediate com- 
munication facilities. 

In some ways space network management is similar to 
the management of complex Earth-based communications 
networks in which there are many types of interconnected 
networks, such as local area networks (LANs) and wide 
area networks (WANs). (In our unique network, however, 
there are also space segments.) Management of these com- 
plex configurations is an area of current research and de- 
velopment because of (1) the large numbers of nodes (in 
the thousands for some enterprises), (2) the geographic dis- 
tances involved, (3) the remoteness of much of the equip- 
ment, (4) the need for human management of the systems, 
and (5) the vision that it is now an achievable goal because 
of the recent standardization of network management pro- 
tocols. 

B. Domains 

The extent of the primary space network suggests a 
logical separation into four domains: Earth, Space, Moon, 
and Mars. Each domain may require varying degrees of se- 
curity, performance, and availability. The networks in each 
domain are summarized in Table 1. This table identifies 
the types of subnets; the actual numbers of such subnets 
are a detailed design issue that will evolve with further 
mission planning. 

1. Earth Domain. The Earth domain includes all the 
networks uniting the the network, mission, and science op- 
erations centers. The Deep Space Network, as the likely 
focus of network operations, would route video, voice, and 
data to the mission and science operations centers over do- 
mestic and international circuits. The network operations 
center could also be the primary location for integrated 
management of the end-to-end network. 

Mission operations and science operations centers are 
responsible for management of the mission and monitoring 
and analyzing scientific data. A reasonable assumption is 
that thousands of network elements would be included in 
the Earth domain. 

2. Space Domain. Moon and Mars main bases will 
be the destination end of the direct link from Earth. The 
main bases are likely to be the primary human interface 
for local network management activities. 
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Remote Moon rovers and science instruments may com- 
municate directly with Earth if they are beyond the line 
of sight to the main base. 

A second mode of Mars-Earth communication will be 
Mars-relay-satellite to Earth. This mode will be used when 
the main base is unable to communicate to the Earth be- 
cause of Mars’ planetary rotation. These links will require 
a complex management system with redundant equipment 
and redundant command channels. 

It has been suggested that a solar activity warning sys- 
tem may be necessary to protect human explorers. 1 Such 
emergency data would be reported directly to the Moon 
and Mars over dedicated links. Depending on the Mars 
permanent outpost position, the data may go directly to 
the outpost or be routed through the communications re- 
lay. The data would provide timely warning of solar activ- 
ity that may pose health hazards to humans on the sur- 
face. Such solar bursts induce extreme noise on the links 
and the links themselves will require substantial fault and 
error protection. 

3. Moon Domain. Lunar surface-to-surface commu- 
nications would be used for video, voice, and data commu- 
nications. Nodes include a habitat, remote science instru- 
ments, and mobile rovers and humans involved in extrave- 
hicular activity (EVA). Locally, the links may be wire, op- 
tical fiber, or line-of-sight radio. Fault and configuration 
management are major issues with lunar communications. 

Beyond the line of sight (approximately 6 km with a 
10-m high antenna), lunar radio communications require 
a lunar surface path with intermediate radio relays, or a 
round-about path via direct Earth links. 

4. Mars Domain. Surface-to-surface communica- 
tions would be used for video, voice, and data commu- 
nications. Nodes are similar to lunar nodes with similar 
communications options. Fault and configuration manage- 
ment are also major issues with Martian communications. 

Beyond the line of sight (approximately 8 km with a 
10-m high antenna), communications would likely be over 
intermediate surface radio relays or through an orbiting 
satellite relay. Surface- to-relay satellite communications 
would provide a path not only to locations over the hori- 
zon, but to the Earth and to other orbiting spacecraft. 

Relay-satellite-to-relay-satellite communications ex- 
tend the routing of voice and data to Earth beyond the 


1 W. Kurth, personal communication, University of Iowa, January 5, 
1992, 


limits of just one relay satellite. These links would also 
require a complex network management system with re- 
dundant equipment and redundant command channels to 
Mars and Earth. Manned and unmanned orbiting space- 
craft at Mars will have network requirements similar to 
the communication relays. Manned orbiters will require 
intensive communication to operations centers on Earth 
and local communications to Mars. 

III. Network Management Technology 

It is the premise of this article that end-to-end net- 
work management will be accomplished through a struc- 
tured, evolvable management architecture based on stan- 
dards because such an architecture is likely to minimize 
life-cycle costs. Hopefully, if the proper standards are cho- 
sen, a standards-based architecture will lead to utilization 
of low-cost commercial software products. The following 
elements are essential for the description of this architec- 
ture: (1) management model, (2) protocol architecture, 
(3) connectivity, and (4) human interfaces. 

A. Management Model 

In concept, network management usually involves appli- 
cation processes called “managers” on managing systems 
and “agents” on managed systems. Current commercial 
approaches generally focus on a management hierarchy 
with three layers of control (Fig. 3): element managers, 
network managers, and an integrated network manager. 

An element manager performs management functions 
relative to that communication element and displays the 
data locally or makes the data available to a higher level 
management system. Typically network elements are man- 
aged through a software agent. The agent is devoted to 
monitoring the status and activity of the network element. 
The agent may be periodically polled by a higher level sys- 
tem or initiate urgent messages to a designated system if 
some threshold has been exceeded. If the concept were 
applied to the primary space network, examples would in- 
clude agent software in network elements of the lunar main 
base or the DSN ground systems. 

A network manager usually manages multiple elements 
on one type of network. A network manager is also con- 
cerned with managing the network circuits or channels, 
monitoring such parameters as link utilization and total 
packet rate over the medium. The area of surveillance 
could range in size from an area as small as a science data 
processing laboratory to as large as an antenna complex. 

An integrated network manager is a layer of control at 
the highest layer that can incorporate information from 
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many types of networks. It typically interfaces to many 
network managers. An integrated network manager could 
be used to oversee end-to-end communications on the 
Earth, in space, and on the Moon and Mars. It is a “man- 
ager of managers.” Such software systems are very com- 
plex; however, they are needed to make extended and com- 
plex systems humanly manageable. One of its greatest val- 
ues is that the data are collected at a central location, and 
automatic fault management processes may be introduced, 
expanded, and modified as experience is gained with the 
network. 

B. Protocol Architecture 

A protocol architecture describes message formats for 
reporting management data and defines the managed ob- 
jects. The managed objects are defined in terms of a 
management information base (MIB). The MIB includes 
a methodology for registering, identifying, and defining 
managed objects. There are numerous network manage- 
ment architectures and protocols available commercially; 
however, only two can be considered “standard.” The first 
is the Simple Network Management Protocol (SNMP), de- 
veloped by the Internet Activities Board (IAB) for use 
in the Internet, the world’s largest public-access network; 
the second is the Common Management Information Ser- 
vice and Protocol (CMIS/CMIP) developed by the Inter- 
national Organization for Standardization (ISO) for use in 
Open Systems Interconnection (OSI) networks. 

1. Internet Standards. The Internet has grown, es- 
pecially in the last few years, as a result of the widespread 
availability of software and hardware supporting the In- 
ternet protocol suite. The suite includes such protocols 
as the Transmission Control Protocol (TCP) and the In- 
ternet Protocol (IP). The Internet Simple Network Man- 
agement Protocol was demonstrated in 1988 and has been 
implemented by a large (and increasing) number of ven- 
dors. Details about the protocol are described in Inter- 
net documents known as Requests for Comments (RFCs) 
that are referenced in the following section. At the present 
time, virtually every new commercial router and bridge in- 
corporates an agent based on the SNMP protocol. Many 
computer companies, including Sun and Hewlett Packard, 
have included an SNMP agent in their operating system 
software, which, in addition to managing communications- 
related objects, also enables management of computer re- 
sources, such as disk storage availability and central pro- 
cessing unit utilization. 

a. SNMP . The SNMP architectural model involves 
a network management station and a collection of net- 
work elements and remote probes (Fig. 4). The network 
management station monitors and controls network ele- 


ments. Network elements are devices, such as worksta- 
tions, routers and the like, which have software agents. 
Agents perform the functions requested by a remote net- 
work management station, and act solely on the elements 
they reside in. Probes passively monitor the network me- 
dia and measure such characteristics as total throughput 
and efficiency. Management stations and network elements 
communicate using the SNMP message protocol. 

Probes are a relatively new extension to the SNMP 
agent architecture. Currently these probes are available 
for Ethernets, and they passively collect statistics and his- 
torical information from the network. 

Monitoring of the network state at any significant level 
of detail is normally accomplished by polling for appro- 
priate information by the network management station. 
A spontaneous message (called a ‘Trap”) is used by the 
agent to notify a network manager of abnormal conditions. 

6. MID. In the SNMP view, the objects to be managed 
are identified in a management information base. The MIB 
is a virtual store— that is, a concept that identifies all the 
objects that need to be managed in the network along with 
their parameters. In its actual implementation, the values 
of the MIB objects may be locally stored in the element 
and then reported upon request to the network manager 
and stored in its database. The organization and structure 
of the MIB is described in RFC 1155 [2]. 

The MIB defined by the Internet community has over 
100 formal objects, called the common MIB [3]. The MIB 
has recently been extended (MIB II); the examples drawn 
in this article relate to the common MIB. A typical device 
may also have an additional 100 to 200 objects that have 
been defined in private or experimental MIB space. This 
extensibility could, for example, be applied to manage el- 
ements processing Consultive Committee for Space Data 
Systems (CCSDS) protocols. 

The common MIB is organized into eight object groups 
(Table 2). While details of these groups are fully described 
in RFC 1213 [4], Table 3 presents an excerpt from the 
Interfaces Group to illustrate the concept. Many of the 
objects are similar to what would be managed in the Earth 
domain of the primary network. 

c. SNMP messaging. The writing (setting) and reading 
(getting) of variables in an agent is accomplished through 
the use of the SNMP message. This protocol is described 
in RFC 1157 [3]. SNMP models all management agent 
functions as alterations or inspections of variables. 

The SNMP message is contained in an SNMP protocol 
data unit (PDU). There are five types of SNMP PDUs: 
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GetRequest, GetResponse, GetNextRequest, SetRequest, 
and Trap, and they are described in Table 4. The User 
Datagram Protocol is used to deliver the SNMP PDUs. 

d. Remote monitoring MIB. The remote monitoring 
(RMON) MIB is an extension of the SNMP MIB that ap- 
plies to probes. The RMON MIB for Ethernet networks 
has been the first to be standardized [5]. It is intended 
that future versions of the RFC will define extensions for 
other network types. 

2. International Standards. International stan- 
dards have also been developed by ISO for the manage- 
ment of OSI-based networks. The OSI management frame- 
work is more elaborate than SNMP, but is similarly de- 
signed to control, coordinate, and monitor network re- 
sources. Although the development of the OSI network 
management architecture is close to completion, there are 
very few OSI agents implemented at the present time. 

The United States Government has mandated the use 
of selected OSI protocols for Federal information systems 
in the Government OSI Profile [6], A transition to these 
protocols is planned for NASA administrative systems 
[7]. Another OSI-related profile, the Government Network 
Management Profile [8], has been proposed for Federal 
systems that specifically focuses on network management. 
The impact of these profiles on stimulating development of 
OSI network management capability remains to be seen. 

OSI management protocols are generic and may be 
used in any OSI command and control environment (not 
just network management). The management structure 
is defined in ISO 7498-4 [9]. In the OSI architecture, 
when an application process, such as an agent or manager, 
needs to exchange information and commands with an- 
other application process, it makes use of software known 
as the Common Management Information Service Element 
(CMISE). The CMIS standard [10] defines the service that 
the CMISE provides and CMIP [11] defines the protocols 
that it uses. 

CMIP and CMIS standards were published in May 
1990. They provide a flexible framework for the con- 
trol and exchange of management information. Together, 
CMIS and CMIP define the bulk of the OSI network man- 
agement protocol. Revised versions of CMIS and CMIP 
(CMIS /CMIP Version 2) were published in January 1992. 
With this update, CMIS and CMIP are expected to remain 
stable for a number of years. 

a. OSI management structure . The requirements for 
OSI network management are grouped into the following 
five major functional areas: (1) fault management, (2) 


accounting management, (3) configuration management, 
(4) performance management, and (5) security manage- 
ment. While the scope is very impressive, a commercial 
product that implements these capabilities is not currently 
available. 

In brief, fault management encompasses fault detection, 
isolation, and correction of abnormal operation of the OSI 
environment. Accounting management enables charges to 
be established for the use of resources in the OSI envi- 
ronment, and for costs to be identified. Configuration 
management includes functions to change the configura- 
tion of the system, set the parameters that control the 
routine operation of the system, and initialize and close 
down managed objects. 

Performance management includes functions to deter- 
mine system performance under natural and artificial con- 
ditions, to gather statistical information, and to maintain 
and examine historical logs. The purpose of security man- 
agement is to support the application of security policies. 

b. CMIS . The major difference between SNMP and 
OSI architectures is that while SNMP assumes an agent- 
manager relationship, OSI does not assume any manage- 
ment hierarchy. In fact, peers may communicate among 
themselves using CMIS/CMIP. Any necessary hierarchy 
is imposed by the management architects. This should 
simplify the development of an integrated network man- 
agement (manager-to-manager) system. 

CMIS management operations include a number of ser- 
vices that are summarized in Table 5. Note that “gets” 
and “sets” are similar in function to their SNMP counter- 
parts. 

c. CMIP . CMIP is a general-purpose protocol that 
supports the services defined by CMIS. CMIP, in turn, 
requires specific support from several relatively hidden 
OSI protocols. For example, the association services, 
M-INITIALIZE, M-TERMINATE, and M-ABORT, are 
supported by CMIP by invoking the Association Control 
Service Element defined in ISO 8649. The notification and 
operation services require use of the Remote Operations 
Service Element defined in ISO 9072-1. These complexi- 
ties are characteristic of OSI implementations. 

3. Summary of Data Architectures. The key 
words that summarize SNMP are “short-term” and “sim- 
ple.” The protocol was implemented within a few months. 
Today the protocol is almost universally implemented in 
network components, such as routers, and in certain com- 
puters. However additional extensions are being developed 
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in a newer version, SNMP Version 2, such as: (1) an au- 
thentication scheme to ensure security and filter out mes- 
sages that may cause catastrophic errors and (2) the means 
to support an integrated network management system at 
a higher level. 

As has been mentioned earlier, few, if any, OSI network 
management agents have been implemented in commercial 
equipment. However, there is strong bureaucratic interest 
in its adoption. The Federal Government promotes it and 
targets integrated network management systems as the key 
beneficiary. 

IBM announced in March 1992 [12] that it plans to use 
CMIP to send information from its Advanced Peer-to-Peer 
Networking Network Nodes to Netview (the IBM network 
management system). This application of CMIP does not 
use OSI transport protocols; it runs over a traditional IBM 
Systems Network Architecture protocol stack and is thus 
a hybrid” application. 

The Internet community has also proposed a hybrid 
CMIP implementation. The implementation, called CMIP 
over TCP/IP or CMOT, has been implemented by a few 
vendors in their network manager software (not in any 
agents) and runs over Internet transport protocols [13]. 

As systems grow to include large numbers of monitored 
objects and subsystems, any standard evolvable manage- 
ment architecture remains a challenge. Fault management 
will be easier to scale because it usually operates on an 
exception basis. Performance and configuration manage- 
ment are more likely to initiate periodic reporting and cre- 
ate more traffic as the number of objects increases. The 
Space Management Network needs prototyping and anal- 
ysis in this area. 

The present application software interfaces have limi- 
tations. There is no sense of time other than “now.” This 
makes it impossible to directly issue queries for histori- 
cal information, or to issue scheduled command requests; 
these queries must be made through user-developed appli- 
cation software. 

C. Connectivity 

Connectivity between the element managers and the 
network management system may be over the primary 
data network or over dedicated links. An advantage of 
dedicated out-of-band circuits is that when the primary 
media becomes unavailable because of congestion or mal- 
functioning communications equipment, management sys- 
tems can still determine and resolve the problem. 

Path diversity is a major consideration in providing a 
wide range of options for the design of robust networks. 


Both SNMP and CMIP/CMIS have control messages that 
can be used to select alternate paths if the initial path is 
blocked or highly congested. 

D. Human Interfaces 

The human interface provides a location for manage- 
ment control of the primary network. It should include a 
standard presentation format at the user and the systems 
level. It should also include the availability of artificial 
intelligence to assist the managers. 

A variety of technologies are expected to provide im- 
proved methods for allowing users to interface to com- 
puter systems. These interface technologies focus on im- 
provement of the amount of information that the user can 
perceive from a given interface configuration. Graphic vi- 
sualization and interactive displays are two particularly 
helpful technologies. Graphic visualization may be used 
to represent multidimensional data on computer graphics 
displays in images and in a form that allows people to per- 
ceive, amplify, and interpret the data. Animated models 
may be used for this interface. 

Interaction between the user and the data will facilitate 
fault resolution. Transformations and algorithms may be 
used to explore the effects on the data. 

Hypermedia software technology enables a user to re- 
trieve data in various formats in one or more display win- 
dows. The formats may include text, graphics, animation, 
digital audio, and video. Hypermedia lends itself to brows- 
ing and searching knowledge bases. 

Artificial intelligence (AI) techniques that help trans- 
late raw data into knowledge may be applied — such as the 
technique of context sensitivity to filter data using dy- 
namic thresholding. Also, AI techniques may be used to 
abstract information and present summaries to the user. 
AI can provide techniques for knowing what state the en- 
tire system is in and how ongoing activities are expected 
to affect that state. 

IV. Space Management Architecture 

A. Management Model 

A suggested architecture to support the First Lunar 
Outpost is illustrated in Fig. 5. Figure 5(a) illustrates 
the primary network and Fig. 5(b) illustrates the SMN, 
pointing out the operations centers. The SMN hierarchy 
has four, rather than three, tiers (Fig. 5(c)). At the high- 
est level is an Integrated Operations Center (IOC) for the 
highest level overview of the status of the extended net- 
work. At the next lower level are two major operations 
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centers: the first is an Earth Operations Center (EOC), to 
support the Earth and Space domains, and the second is 
a Moon Operations Center (MOC), an on-site facility to 
support the Moon domain. At a later time, a Mars Op- 
erations Center will be required. At the next lower level 
are facility managers that provide network management 
for a limited number of large facilities, such as the DSN. 
At the lowest level are numerous element managers; these 
managers oversee individual network components, such as 
bridges, routers, and computers. 

1. Integrated Network Manager. An integrated 
network manager will provide high-level coordination and 
security management. It will primarily communicate with 
the EOC and MOC and sometimes coordinate activities. 
The IOC may serve as an alternate EOC in the event of 
an emergency. This situation must be jointly reported for 
action. The action may be to jointly modify the forward 
error correction coding algorithm to increase the level of 
coding. 

2. Operations Managers. Operations management 
systems will be required at the EOC and MOC where 
global fault, configuration, performance, and security is- 
sues must be reconciled. A possible role could be resolv- 
ing an event, such as a severed fiber trunk that causes sud- 
den communications outage between the DSN and Mission 
Control. Facility managers will report the problem to the 
EOC for action and the EOC may automatically reroute 
the signals over a diverse path. 

3. Facility Managers. The facility managers oversee 
all the elements associated with their facilities. At this 
layer there is substantial fault tolerance — usually through 
the ability to manually and automatically reconfigure ac- 
tive elements. Potential facilities include the lunar main 
base, DSN, Mission Operations, Science Operations, and 
Network Operations. A possible role could be detection of 
a security threat, i.e., a persistent hacker. Such a problem 
would normally be elevated to the IOC for action. 

4. Element Managers. The proposed SMN architec- 
ture has many element managers at each of the facilities. 
The element managers normally communicate with the fol- 
lowing: (1) a facility manager to coordinate a response to 
local events, such as reboots and power outages; (2) a lo- 
cal human interface for local maintenance; and (3) a local 
database to store MIB parameters. The element should 
have fault tolerance through its own internal design. 

B. Data Architecture 

The primary architecture requirement is that the el- 
ement managers incorporate the same standard manage- 
ment agents. The current popular standard is, of course, 


SNMP. Assuming that it will evolve into a more secure 
protocol with peer-to-peer communication capability, it is 
a leading candidate to prototype the higher management 
layers for the Space Management Network. A simple tran- 
sition can be made later to CMIP, if such a transition is 
advantageous. 

Network management can be implemented in the near 
term by specifying SNMP in all Earth-domain network 
elements — especially the DSN ground systems. The MIB 
can also be extended to include OSI and CCSDS protocol 
performance. 

C. Connectivity 

A major issue in the design of the SMN is the design 
of the physical network. The telecommunications industry 
has been migrating toward out-of-band or “common chan- 
nel” signaling, a technique of putting management data 
into its own data channel separate from the primary data. 
In the case of LANs, this would be a separate LAN; in 
the case of WANs, it would be either a diverse circuit or 
a dedicated radio channel. In the space domain, the man- 
agement data path may be a dedicated radio channel. 

Another issue is the use of redundant communication 
paths for management data. A leased circuit may be iden- 
tified for nominal conditions in the Earth domain. As a 
backup, an alternate path, such as a dial-up circuit, may 
be used to retrieve management data. 

The bandwidth required for management data will nor- 
mally be relatively low. To minimize cost, bandwidth-on- 
demand is an important technology for exploration mis- 
sions for both the primary network and the management 
network. 

D. Human Interfaces 

Graphic visualization should be used to present multidi- 
mensional data in images and in a form that allows people 
to perceive, amplify, and interpret the data. Of particu- 
lar concern is the configuration and status of the primary 
network. Animated models and interactive displays may 
also be used. 

Hypermedia software technology should enable a user 
to retrieve data in various formats in one or more display 
windows. The formats may include text, graphics, anima- 
tion, digital audio, and video. 

Artificial intelligence techniques should also be applied 
in each of the operations centers to assist operations per- 
sonnel to recognize and diagnose problems and develop 
alternative solutions. 


123 


V. Test-Bed Implementation 

A prototype network management system for the DSN 
was configured in the DSN 2000 Information Systems En- 
gineering Laboratory (Fig. 6). The management model 
was a simple two-tier hierarchy with a commercial network 
manager (SunNet Manager Version 1.2) in a dedicated Sun 
SparcStation IPX computer and element managers in each 
network element (routers and computers). The data archi- 
tecture was based on SNMP, and connectivity of the net- 
work manager to the elements was through two Ethernets 
joined by a serial circuit through commercial routers. 

SunNet has a graphical user interface based on Open- 
Look/X Windows. SunNet maintains a database of the 
network elements and it can be configured to periodically 
poll the elements for crucial information, such as the state 
of the interfaces. 

The information that is obtained by polling can be ex- 
amined with a textual browser or an elementary three- 
dimensional graphing package. The element database and 
polling capabilities of SunNet provide a low-level manage- 
ment station capability that can be extended by other ven- 
dors or by the user with additional effort. 

The SunNet Manager software package included addi- 
tional SNMP agents that could be installed on any Sun 
SparcStation. In addition to these agents, the labora- 
tory had access to a public-domain SNMP agent, available 
from Carnegie Mellon University (CMU), which was use- 
able (with minor modifications) on any Unix workstation. 
The CMU agent included source code, which allowed ex- 
tension of the agent to monitor private variables on the 
Sun workstations. SunNet Manager can be configured to 
recognize and manage these extra variables. 

Using SunNet Manager as a manager of the two-tier 
prototype network revealed severallimitations in its use for 
Facility and Operations Center management; these limits 
may be addressed by additional application software or 
future SunNet Manager upgrades. 

A relational database with number manipulation abili- 
ties would have been useful in analyzing the average traffic 
flowing over an interface and helpful in characterizing its 
behavior. A statistics program would have helped in ana- 
lyzing data and determining what a “reasonable” number 
of errors on an interface would be. Ideally, a manage- 
ment station should incorporate an expert system with 
knowledge about the network’s configuration, so that it 
can anticipate problems, suggest solutions, or automate 
this process. SunNet did not have these capabilities. 

Another benefit that a more advanced network man- 
agement station should offer is “intelligent” polling, which 


can reduce the bandwidth required for network manage- 
ment. Under normal conditions, the network manager 
would request only a small number of management vari- 
ables. If something anomalous were to be detected, the 
station could check additional variables to determine if a 
real problem exists. 

There were several conclusions: 

(1) Inexpensive commercial software exists today for do- 
ing network management on Internet-standard net- 
works. The managers are relatively inexpensive, 
ranging from $1,000 to $10,000, depending on the 
features and degree to which the software is bundled 
with a larger computer purchase. SunNet Manager 
was only $1,000. The element agents are inexpen- 
sive or come free; they are usually included in most 
routers and many computers. The SNMP agent soft- 
ware is also publicly available, and may be used with 
older Unix and MS-DOS workstations. 

(2) Limited bandwidth over certain links, such as the 
DSN WAN circuits, require that the volume of man- 
agement traffic be minimized, and this was possible 
to a certain extent with SunNet Manager by chang- 
ing the polling rate. In the prototype, the agents 
were polled every 8 sec. A four-level hierarchical 
management structure, such as that proposed in this 
article, would tend to minimize traffic even more be- 
cause each network manager would send only sum- 
mary data to higher level managers. 

(3) The element agents can be modified to support addi- 
tional MIB objects, such as those that would be DSN 
specific. These potentially include CCSDS protocol 
objects. SunNet Manager was highly configurable 
and easily adopted the new MIB objects. 

(4) The current SNMP standard does not explicitly in- 
clude manager-to-manager communications; how- 
ever, it would be possible for a user to develop this 
capability within the existing standard. SNMP Ver- 
sion 2, currently in the standards process, includes 
this ability. 

(5) It is possible to send management information over 
alternate circuits under the control of SNMP. In the 
laboratory, the computers and routers had diverse 
connectivity, which allowed management informa- 
tion to be transmitted over alternate Ethernets even 
if one specific network were down. This is an im- 
portant design consideration for the Space Manage- 
ment Network. A robust, reliable network must have 
such alternate connections available. With a net- 
work management system, failures can be detected 
and data can be transmitted over alternate routes 
either automatically or with operator consent. 
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VI. Conclusions 

A network management architecture was proposed for 
future exploration missions. The architecture has four lay- 
ers of management: (1) element management for local 

monitor and control of communications nodes, (2) facil- 
ity management for managing major sites, (3) two global 
operations centers to oversee Earth and Moon operations, 
and (4) an Integrated Operations Center to oversee end-to- 
end (e.g., lunar instrument-to-principal researcher) com- 
munications. 

The most widely available standard in commercial net- 
working hardware is the SNMP Internet standard. The 
standard is being upgraded by the IAB to include much 
of the functionality of its OSI counterpart, CMIP. Sub- 
stantial cost savings will result from using the standard in 
the Earth domain that has the widest implementation and 
lowest cost, and this appears to be SNMP at the present 
time. 

In the context of the end-toend data system, the DSN 
is one of several facilities in the architecture. Although the 
DSN is one of the most important elements, it currently 
lacks a standard network management capability. 

The following relevant issues remain: 

(1) DSN Network Management. The DSN facility has 
strategic importance as the primary Earth- and 
space-domain interface. A DSN management archi- 


tecture, when implemented, could be used as a refer- 
ence implementation for other facilities in the SMN. 

(2) Messaging Standard. Manager-to-manager commu- 
nications need to be prototyped with a messaging 
technique, such as SNMP 2 or CMIP. 

(3) MIB Development. CCSDS standards are to be ap- 
plied to most future space missions. In order to man- 
age network processing and distribution of CCSDS 
packets, a CCSDS MIB is required. Based on the ex- 
perience of the Internet, the CCSDS MIB could be 
developed and tested in a CCSDS testbed, and then 
be permitted to evolve based on experience. Stan- 
dardization of the MIB with the CCSDS would be a 
necessary step. 

(4) First Lunar Outpost. A space management network 
for a specific proposed mission, such as the First 
Lunar Outpost, needs to be baselined in which spe- 
cific roles for element, facility, operations, and inte- 
grated management are defined. The requirements 
for fault, performance, configuration, and security 
management need to be defined. 

(5) Automated Functions. Automated functions need to 
be identified for selected remote activities, and then 
prototyped. This capability would demonstrate con- 
figuration management responsibilities of the SMN. 
An example could be to simulate alternate paths be- 
tween major nodes (path diversity) and include au- 
tomation as a means to select an optimal path. 
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Table 1. Domains and potential subnetworks. 


Domains 

Types of subnets 

Earth 

Earth surface-to-surface via wire/fiber 
Earth surface-to-surface via radio 
Earth surface-to-surface via satellite 

Space (Moon) 

Earth surface- to- Moon surface (main base) 
Earth surface-to-Moon surface (remote) 

Space (Mars) 

Earth surface- to* Mars surface (main base) 
Earth surface- to- Mars surface (remote) 
Earth surface to Mars relay satellite 

Moon 

Moon surface-to-surface via wire/fiber (local) 
Moon surface-to-surface via radio (remote) 

Mars 

Mars surface-to-surface via wire/fiber (local) 
Mars surface-to-surface via radio (remote) 
Mars surface- to- relay satellite 
Mars relay satellite- to-relay satellite 


Table 2. SNMP managed object groups. 


Object group 

Description 

System 

Objects that describe high-level characteristics of 
this network element 



Interfaces 

Objects associated with the network interfaces to 
which this network element can communicate with 
IP datagrams 


Address translation 

Translation table for converting an IP address into 

- 


a subnetwork- specific (physical) address 


Internet Protocol (IP) 

Subgroup of objects associated with IP 


Internet Control Message Protocol 

Subgroup of objects associated with ICMP 

= 

(ICMP) 


- 

Transmission Control Protocol (TCP) 

Subgroup of objects associated with TCP 


User Datagram Protocol (UDP) 

Subgroup of objects associated with UDP 

- 

Exterior Gateway Protocol (EGP) 

Subgroup of objects associated with EGP 


n 

3 
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Table 3. Representative objects In the SNMP Interfaces Group. 


Object 

Definition 

Access 

ifN umber 

The number of network interfaces (regardless of their current 
state) present on this system. 

Read-only 

iflndex 

A unique value for each interface. Its value ranges between I 
and the value of ifN umber. The value for each interface must remain 
constant at least from one re- initialization of the entity’s network 
management system to the next re- initialization. 

Read-only 

IfDescr 

A textual string containing information about the interface. This 
string should include the name of the manufacturer, the product name 
and the version of the hardware interface. 

Read-only 

ifType 

The type of interface, distinguished according to the physical/ 
link protocol(s) immediately “below” the network layer in the 
protocol stack. 

Read-only 

ifMtu 

The size of the largest datagram which can be sent/received on the 
interface, specified in octets. For interfaces that are used for transmitting 
network datagrams, this is the size of the largest network datagram 
that can be sent on the interface. 

Read-only 

ifSpeed 

An estimate of the interface’s current bandwidth in bits per second. 
For interfaces which do not vary in bandwidth or for those 
where no accurate estimation can be made, this object should contain 
the nominal bandwidth. 

Read-only 

ifAdminStatus 

The desired state of the interface. 

Read- Write 

jfOperStatus 

The current operational state of the interface. 

Read-only 


Table 4. SNMP PDUs. 


SNMP PDUs 

Description 

GetRequest 

At the initiation of this PDU by the sender, this PDU requests the 
current status of objects from the destination system. 

GetResponse 

In response to the GetRequest-PDU, the destination system returns the 
name and value of each object requested using this PDU. 

GetNextRequest 

This PDU is used to simplify the retrieval of successive variables in the 
MIB that are ordered in the form of a table (such as routing table data). 

SetRequest 

Upon receipt of this PDU, for each object named in the PDU, the 
corresponding value is assigned to the variable. The receiving station 
returns a GetResponse-PDU of identical form as an acknowledgment. 

Trap 

Upon receipt of the Trap-PDU, the data contents are passed to the 
application-level software for appropriate processing. Several generic 
traps include: ... . ........... 

(l) ColdStart Trap — the sending entity is reinitializing itself; its 
implementation may be altered. 


(2) WarmStart Trap — the sending entity is reinitializing itself; its 
implementation will not be altered. 


(3) LinkDown Trap — the sending entity recognizes a failure in one of the 
communication links represented in its configuration. 


(4) LinkUp Trap — the sending entity recognizes that one of the 

communication links represented in its configuration has come up. 
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Table 5. Management Operation Services. 


Service primitive 

Description 

M-GET 

Request the retrieval of management information from 
a peer CM IS E- service-user. The service may be requested only 
in a confirmed mode, and a reply is expected. 

M-SET 

Request the modification of management information 
by a peer CMISE- service- user. The service may be 
requested in a confirmed or a non- confirmed mode. In the 
confirmed mode a reply is requested. 

M- ACTION 

Request a peer CMISE-service-user to perform an action. 

The service may be requested in a confirmed or a non-confirmed 
mode. In the confirmed mode, a reply is expected. 

M-CREATE 

Request a peer CMISE-service-user to create another instance 
of a managed object. The service may be requested only in a 
confirmed mode, and a reply is expected. 

M-DELETE 

Request a peer CMISE-service-user to delete an instance 
of a managed object. The service may be requested only in a 
confirmed mode, and a reply is expected. 




Fig. 1. Space Exploration Initiative communications architecture. 
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Fig. 2. Simplified schematic of primary network: (a) Moon primary space network and (b) 

Mars primary space network. 
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Fig. 5. Network management architecture for tHe Moon: (a) Moon primary space network; (b) space management 
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TDA Progress Report 42-112 


February 15, 1993 


Errata 


T. Y. Otoshi and M. M. Franco have submitted the following errata to “DSS-13 Beam Waveguide Antenna Fre- 
quency Stability” that appeared in The Telecommunications and Data Acquisition Progress Report J2-1I0, vol. April- 
June 1992, pp. 151-162. 

In the discussion of wind direction on pages 152 and 153, the terms “front side” and “back side” of the antenna 
should be interchanged. Also, the phrase “blowing into the face” should be replaced with “blowing into the back.” 

The changes are made after discovering that the instruments at DSS 13 give wind direction as coming from instead 
of going towards an indicated direction. Figures 7 and 14 of the original article should be replaced by the ones that 
follow. 
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